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ANGIOPOIETIN RELATED GENE SEQUENCE SCARFACE 1 

5 BACKGROUND OF THE INVENTION 

This invention relates to recombinant DNA technology. 
In particular the invention pertains to angiogenesis related 
genes, their encoded proteins, and to pharmaceutical and 
other uses thereof • 

10 Identifying the role of specific proteins in 

development and differentiation of higher organisms, 
including humans, has become a major challenge for modern 
molecular biology. In this regard the study of simpler 
organisms such as Drosophlla melanogaster has greatly 

15 facilitated progress in this area. For exaziple, a secreted 
protein termed **Scabrous" in Drosophlla, plays an important 
role in defining positional aspects of developing neurogenic 
regions. Homology comparisons reveal that Scabrous is 
related to the angiopoietin family of proteins, which are 

20 necessary for normal angiogenesis in mcuranals. 

Angiogenesis begins during embryonic development auid 
continues throughout the life of the organism. During 
embroyonic development, a primitive vascular network forms 
by processes that involve regulated proliferation, 

25 differentiation, migration, and association of endothelial 
cells. Subsequent angiogenic processes remodel this primary 
network to form a mature cardiovascular system. 

Angiogenic remodeling occurs throughout life in an 
adult organism, for exaxi^le, during tissue repair and the 
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menstrual cycle. Angiogenesis is also critically important 
during the proliferation and metastasis of tiimor cells. 

Mechanistically, angiogenesis involves multiple 
proteins, including tyrosine kinase receptors and protein 
5 ligands that bind the receptors to stimulate or inhibit 
angiogenesis. See e.g. J. Polkman and P. D'Amore, Cell 87, 
1153-1155, 1996; S. Davies et al. Cell, 87, 1161-1169, 1996, 
herein incorporated by reference. The first protein to be 
identified in this process was dubbed «angiopoietin 1" (See 
10 S. Davis et al . Cell, 87, 1161-1169, 1996). Angiopoietin 1 
comprises a 70 KD glycosylated protein of 498 amino acid 
residues in human and mouse. There are at least two 
recognizable domains on this protein, one a fibrinogen- like 
region at the carboxyl end, and the other a coiled- coil 
15 domain at the amino terminus. Coiled- coil regions typically 
are involved in the assembly of a protein into multimeric 
structures. It is thought that the coiled-coil region of 
angiopoietin 1 may serve to assemble the fibrinogen- like 
domain into a multimeric structure. The fibrinogen- like 
20 domain of angiopoietin 1 and other related molecules is 
probably the receptor binding portion for this class of 
molecules (See Valenzuela et al. Proc. Nat. Acad. Sci. 96, 
1904-1909, 1999) . 

Angiopoietin 1 appears to function in angiogenesis by 
25 binding the Tie2 receptor and promoting tyrosine 

phosphorylation thereof. Interestingly, angiopoietin 1 does 
not stimulate the growth of endothelial cells in vitro, as 
does VEGF and fibroblast growth factor, for example. Thus, 
angiopoietin 1 would seem to play a role other than as a 
30 mitogen. The evidence to date favors a role in regulating 
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the assembly of non-endothelial vessel wall components, and 
in controlling the maturation and stabilization of blood 
vessels (See C. Suri et al. Cell, 87, 1171-1180, 1996). 

Angiopoietin 1 has a naturally occurring antagonist, 

5 termed angiopoietin 2, which blocks the ability of 

angiopoietin 1 to bind its receptor. Tie 2. Angiopoietin 2 
appears to be expressed only at sites of vascular remodeling 
in the adult. (Maisonpierre et al. Science, 277, 55-60, 
1997) The other receptor known to be required for normal 

10 angiogenesis is Tiel (See e.g. D. Hanahan, Science, 277, 48, 
1997) . The Tie2 tyrosine kinase enables endothelial cells to 
recruit stromal cells for the encasement of endothelial 
tubes. Tiel, on the other hand, may be involved in the 
control of fluid exchange across capillary boundaries, and 

15 in hemodynamic stress resistance. 

Recently, two additional angiopoietin-related proteins 
were identified in mouse and human, termed angiopoietin 3 
and angiopoietin 4 (See Valenzuela et al . Proc. Nat. Acad. 
Sci. 96, 1904-1909, 1999). These molecules also bind the 

20 Tie2 receptor, angiopoietin 3 as an antagonist, but 
angiopoietin 4 as an agonist. 

While the full complement of molecular players in the 
angiogenic process are yet to be defined, it is clear that 
additional angiopoietin- like ligands and receptors exist. 

25 The present invention relates to an angiopoietin- like gene 
and its encoded protein, as well as to pharmaceutical and 
other uses thereof. 



BRIEF SUMMARY OF THE INVENTION 
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The present invention provides angiopoiet in- related 
nucleic acid molecules and their encoded proteins- The genes 
and proteins of the invention are referred to herein as 
^^Scarface 1." A specific Scarf ace 1 gene sequence is 
5 disclosed herein as SEQ ID NO:l, which originated from a 

human source. The Scarf ace 1 nucleic acids of the invention 
are useful, among other things, as probes to isolate 
paralogous genes from humans and/or orthologous genes from 
other organisms. 

10 The Scarf ace 1 peptides provide multiple utilities, for 

example, in screens to identify compounds that bind said 
peptides, as potential pharmaceutical compounds for 
modulating angiogenesis . Scarf ace 1 peptides are also useful 
as therapeutics to enhance woujid healing, for developing 
15 compounds for inhibiting tumor growth, for treating cancer, 
and for use in a method to amplify stem cells. 

The Scarf ace 1 proteins may further be useful in the 
diagnosis and treatment of certain diseases involving 
endothelial cells and associated TIE receptors, such as 
20 neoplastic diseases involving tujmor angiogenesis, wound 
healing, thromboembolic diseases, atherosclerosis and 
inflammatory diseases. In addition, the mammalian ligand may 
be used to promote the proliferation smd/or differentiation 
of hematopoietic stem cells. 
25 More generally, biologically active mammalian Scarface 

1 may be used to promote the growth, survival, migration, 
and/or differentiation and/or stabilization or 
destabilization of cells expressing TIE receptor. 
Biologically active Scarface 1 may be used for the in vitro 
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10 



15 



20 



maintenance of TIE receptor expressing cells in culture • 
Cells and tissues expressing TIE receptor include, 
for example, cardiac and vascular endothelial cells, lens 
epithelium and heart epicardium and early hematopoietic 
cells. Alternatively, Scarf ace 1 may be used to support 
cells which are engineered to express TIE receptor. Further, 
Scarface 1 and its cognate receptor may be used in assay 
systems to identify agonists or antagonists of TIE receptor. 

The invention further provides a nucleic acid sequence 
encoding Scarface 1, and methods for the generation of 
nucleic acid encoding Scarface 1. 

In one embodiment, the present invention relates to a 
substantially purified Scarface 1 peptide or fragment 
thereof . 

In another embodiment, the present invention relates to 
a substantially purified protein preferably having an amino 
acid sequence that is at least 70% identical to SEQ ID NO: 2, 
said protein having Scarface 1 activity in vivo or in vitro. 

In another embodiment, the present invention relates to 
a substantially purified Scarface 1 peptide from a human 
source wherein said peptide has an amino acid sequence 
designated herein as SEQ ID NO: 2. 

In another embodiment, the present invention relates to 
a substantially purified protein having an amino acid 
sequence that is encoded by a nucleic acid sequence that 
hybridizes to SEQ ID NO:l under high stringency conditions, 
said protein having Scarface 1 activity. 

In another embodiment, the present invention relates to 
isolated nucleic acids encoding an Scarface 1 peptide. 



wo 99/62925 



-6- 



PCT/US99/11933 



In another embodiment, the present invention relates to 
isolated nucleic acids encoding Scarf ace 1, designated 
herein as SEQ ID NO: 2. 

In another embodiment, the present invention relates to 
an isolated nucleic acid sequence encoding Scarface 1 said 
nucleic acid being designated herein as SEQ ID NO:l. 

In still another embodiment the present invention 
relates to a vector comprising an isolated nucleic acid of 
the present invention, wherein said isolated nucleic acid 
compound may be operably- linked to a promoter sequence • 

In another embodiment, the present invention relates to 
a host cell containing a vector of the present invention. 

In yet another embodiment, the present invention 
relates to a pharmaceutical formulation comprising as an 
active ingredient a therapeutically effective amount of an 
Scarface 1 peptide, associated with one or more 
pharmaceutically acceptable carriers, excipients, or 
diluents thereof ♦ 

In a further embodiment, the present invention relates 
to a pharmaceutical formulation comprising an Scarface 1 
antagonist associated with one or more pharmaceutically 
acceptSLble carriers, excipients, or diluents thereof. 

In a further embodiment, the present invention provides 
for an antibody which specifically binds Scarface 1. The 
antibody may be monoclonal or polyclonal. 

The invention further provides for therapeutic 
compositions comprising an antibody which specifically binds 
Scarface 1 in a pharmaceutically acceptable vehicle. 

In a further embodiment, the invention provides for a 
method of blocking blood vessel growth in a mammal by 
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administering an effective amount of a therapeutic 
composition comprising an antibody which specifically binds 
Scarf ace 1 in a pharmaceutically acceptable vehicle. 

The invention also provides for a method of promoting 

5 neovascularization in a patient by administering an 

effective amount of a therapeutic composition comprising 
Scarf ace 1 in a pharmaceutically acceptable vehicle. In one 
embodiment, the method may be used to promote wound healing. 
In another embodiment, the method may be used to treat 

10 ischemia. In yet another embodiment. Scarf ace 1 is used, 

alone or in combination with other hematopoietic factors, to 
promote the proliferation or differentiation of 
hematopoietic stem cells, B cells or megakaryocytic cells. 



15 relates to a method for identifying an antagonist of an 
Scarf ace 1 peptide, wherein said antagonist interferes in 
the binding of said Scar face 1 peptide to its receptor, 
comprising the steps of: mixing an Scarf ace 1 preparation 
with a test sample, or a control sample; monitoring the 

20 binding of said Scarf ace 1 peptide to its receptor, by any 
suitable means; and comparing the level of binding of 
Scarface 1 to its receptor in the test sample to the level 
of said binding in the control san^le. 



25 relates to a method for suppressing tximor growth comprising 
the step of inhibiting the binding interaction in vivo of 
Scarface l to an Scarface 1 receptor by administering to a 
mammal in need thereof an effective amount of an Scarface 1 
antagonist . 



In still another embodiment the present invention 



In yet another embodiment, the present invention 
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In another embodiment, the present invention relates to 
a method for stimulating angiogenesis by administration of 
an Scarf ace 1 peptide. 

In still another embodiment, the present invention 
5 relates to a method for stimulating wound healing by 

administration of a therapeutically effective amoxint of an 
Scarf ace 1 peptide. 

In still smother embodiment, the present invention 
relates to a method to treat ischemia by administration of a 
10 therapeutically effective amount of an Scarf ace 1 peptide. 

In another embodiment, the present invention relates to 
a method of blocking blood vessel growth in a mammal by 
administering an effective amount of a therapeutic 
composition comprising an antibody which specifically binds 
15 Scarf ace 1 in a pharmaceutically acceptable vehicle. 

In still another embodiment, the invention provides for 
a method of blocking blood vessel growth in a mammal by 
administering an effective amount of a therapeutic 
composition comprising a receptorbody which specifically 
20 binds Scarf ace 1 in a pharmaceutically acceptcdDle vehicle. 

In still another embodiment, the present invention 
relates to an antibody that selectively binds to an Scarf ace 
1 peptide. 

In yet another embodiment, the present invention 
25 relates to a method for promoting the proliferation of, or 
expanding stem cells comprising the step of administration 
of an Scarface 1 peptide to a tissue or to a population of 
cells that contain said stem cells. 

In still another embodiment, the present invention 
30 relates to a method for expanding hematopoietic stem cells 
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comprising the step of administration of an Scarface 1 
peptide to a population of bone marrow derived cells. 



DETAILED DESCRIPTION OF THE INVENTION 



5 



Definitions 



The term analog" or functional analog" refers to a 
related modified form of Scarface 1, in which at least one 
amino acid substitution, deletion, or addition has been made 
such that said analog retains substantially the same 
10 biological activity as the xinmodified foann, in vivo and/or 
in vitro. 

The term "angiogenesis" refers generally to the 
formation and differentiation of blood vessels. 



15 antibodies, fragments of antibodies (such as, but not 
limited, to Fab, Fab', Pab2 ' , and Fv fragments), and 
chimeric, humanized, veneered, resurfaced, or CDR-grafted 
antibodies capable of binding antigens of a similar nature 
as the parent antibody molecule from which they are derived. 

20 The term chimeric protein" or « fusion protein" refers 

to a hybrid protein or peptide comprising part or all of a 
Scarface 1 protein or analog thereof linked to a 
heterologous protein. The heterologous protein can be fused 
to the N-terminus or the C-terminus of Scarface 1. A 

25 preferred chimeric protein comprises the fibrinogen- like 

domain of a Scarface 1 protein fused to the constant region 
of an antibody. 



used herein refer to the capacity of purine and pyrimidine 
30 nucleotides to associate through hydrogen bonding to form 



The term "antibody" as used herein describes 



The terms "complementary" or "complementarity" as 
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doxible stranded nucleic acid molecules. The following base 
pairs are related by complementarity: guanine and cytosine; 
adenine and thymine; and adenine and uracil. As used herein, 
« complementary" means that the aforementioned relationship 

5 applies to substantially all base pairs comprising two 
single- stranded nucleic acid molecules over the entire 
length of said molecules. « Partially complementary" refers 
to the aforementioned relationship in which one of two 
single- stranded nucleic acid molecules is shorter in length 

10 than the other such that a portion of one of the molecules 
remains single - stranded • 

The term conservative s\abstitution" or 
^conservative amino acid substitution" refers to a 
replacement of one or more amino acid residue (s) in a parent 

15 protein as stipulated by Table 1. 

The term fibrinogen- like domain" refers to a 
conserved region of amino acids typically found at the 
carboxy end of a protein molecule that contains such a 
domain, said domain being related structurally and/ or by 

20 sequence to the homologous region of a fibrinogen protein 
molecule. For example, the fibrinogen- like domain of 
Scarface 1 resides between the position approximating 
residues 275 through 493 of SEQ ID NO: 2. The term is 
intended to include domains that are at least 65% identical 

25 with residues 275 through 493 of SEQ ID NO: 2; alternatively, 
between 70% to 100% identical; preferably, greater than 85% 
identical; more preferably, at least 90% identical; and most 
preferably, greater than 95% identical. The fibrinogen- like 
domain is thought to be important for the biological 
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activity of Scarf ace 1 and related molecules, for example, 
as the receptor binding site. 

The term « fragment thereof refers to a fragment, 
piece, or sub- region of a nucleic acid or protein molecule 

5 whose sequence is disclosed herein, such that the fragment 
comprises 5 or more amino acids, or 10 or more nucleotides 
that are contiguous in the parent protein or nucleic acid 
molecule. "Fragment thereof" may or may not retain 
biological activity. A fragment of a protein disclosed 

10 herein could be used as an antigen to raise a specific 

antibody against the parent protein molecule. With reference 
to a nucleic acid molecule, fragment thereof" refers to 10 
or more contiguous nucleotides, derived from a parent 
nucleic acid. The term also encompasses the complementary 

15 sequence. For example if the fragment entails the sequence 
5' -AGCTAG-3' , then « fragment thereof" would also include the 
complementary sequence , 3 ' -TCGATC- 5 ' . 

The term "functional fragment" or "functionally 
equivalent fragment", as used herein, refers to a region, or 

20 fragment of a full length protein, or sequence of amino 
acids that, for example, con5)rises an active site, or any 
other motif, relating to biological function. Pxinctional 
fragments are capable of providing a substantially similar 
biological activity as a protein disclosed herein. In vivo 

25 or in vitro, viz. the capacity to modulate angiogenesis, or 
the ability to promote phosphorylation of the Tie2 receptor 
on, for example, epithelial cells. Functional fragments may 
be produced by cloning technology, or as the natural 
products of alternative splicing mechanisms. 
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The term functionally related" is used herein to 
describe peptides that are related to the Scarface 1 
peptides of the present invention, said functionally related 
peptides constituting modifications of said Scarface 1 
5 peptides in which conservative or other suitable amino acid 
changes are present, or are introduced by natural mutation 
or by recombinant DNA techniques. Functionally related 
peptides retain the biological activity of Scarface 1 
peptides, such as the ability to modulate angiogenesis, or 
10 the ability to stimulate phosphorylation of Tie2 (See e.g. 
D. Dumont et al. Oncogene 7, 1471-1480, 1992) . 

The term ~ structurally and functionally related" 
is used herein to describe peptides or proteins that are 
related structurally and functionally to Scarface 1. Said 
15 proteins, generally constitute modifications of SEQ ID NO: 2 
in which conservative or other suitable amino acid changes 
are introduced by natural mutation or by recombinant DNA 
techniques, or by other means, or by deletion of one to ten 
contiguous amino acid residues of SEQ ID NO: 2. Generally, 
20 structurally and functionally related proteins are greater 
than 50% identical with SEQ ID NO:2, alternatively greater 
than 65% identical, preferably between 70% to 100% identical 
with SEQ ID NO:2; still more preferably between 85% to 95% 
identical; more preferably still, at least 90% identical; 
25 most preferably at least 95% identical; still most 

preferably between 98% to 100% identical to SEQ ID NO: 2. The 
overall percent identity between nucleic acid molecules or 
protein molecules is determined by any suitable comparison 
algorithm, well known to the skilled artisan. The overall 
30 percent identity is determined by the number of residues 
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that are identical between optimally aligned nucleic acids 
or proteins, divided by the total length (including gaps if 
present) of the shortest sequence of the pair or group being 
compared . 

5 Percent identity can also refer to, or be 

expressed as, a local comparison of regions, motifs, or 
blocks within molecules. Local identities between molecules 
may be greater than 80% to 90%, while surrounding regions 
may be less conserved, for example, less than 50% identical, 

10 Such variability in sequence conservation along the length 
of molecules can produce a low percentage identity overall, 
but a high percentage identity within locally defined 
regions. Examples of locally conserved regions might include 
binding sites, or catalytic sites. The invention is 

15 therefore intended to include analogs of Scarface 1 that are 
at least 40% identical with the fibrinogen binding region 
comprising residues from about 275 through 493 of SEQ ID 
NO:2. 

Nucleic acid or protein sequences are optimally 
20 aligned so as to achieve the greatest degree of similarity, 
allowing for gaps, using any suitable algorithm, for 
example, a dynamic progrsimming algorithm (See e.g. Smith and 
Waterman, J. Mol . Biol. 147, 195 (1985), BIASTA (Altschul et 
al. J. Mol. Biol. 215, 403 (1990); or FASTA (Lipman & 
25 Pearson, Science, 227, 1435 (1985), herein incorporated by 
reference. Such alignments are carried out with the 
appropriate algorithmic parameters set to maximize the 
alignment score obtained for a pair of sequences being 
compared. For example, using a BLAST algorithm, the Hsp 
30 parameter can be set at from 10 to 11 for comparing nucleic 
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acids and at 3 when comparing amino acid sequences. Other 
parameters that can be varied include the expectation value 
parameter, most typically set to 10. The skilled artiseui is 
aware of the values most appropriate so as to maximize the 

5 alignment between any two or more sequences. Said 

structurally and functionally related proteins retain the 
biological activity of Scarf ace 1, such as the ability to 
modulate angiogenesis, or the ability to stimulate 
phosphorylation of Tie2, or to inhibit solid tumor growth in 

10 vivo, or the ability to suppress the growth of hematopoietic 
progenitor cells (See e.g. D. Dumont et al . Oncogene 7, 
1471-1480, 1992) . 

Thus, the present invention comprises Scarf ace 1, as 
defined by SEQ ID NO: 2, as well as functionally equivalent 

15 variants comprising naturally occurring allelic variations, 
as well as proteins or peptides comprising substitutions, 
deletions or insertional mutants of SEQ ID NO: 2, which bind 
TIE receptor and act as agonists or antagonists thereof . 
Such variants include those in which amino acid residues are 

20 substituted for residues within SEQ ID NO: 2 resulting in a 
silent change. For example, one or more amino acid residues 
within the sequence can be substituted by another amino 
acid(s) of a similar polarity which acts as a functional 
equivalent, resulting in a silent alteration. Substitutes 

25 for an amino acid within the sequence may be selected from 
other members of the class to which the amino acid belongs . 
For example, the class of nonpolar (hydrophobic) amino acids 
include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan and methionine. The polar neutral 
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amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutaitiine. The positively charged 
(basic) cunino acids include arginine, lysine and histidine. 
The negatively charged (acidic) amino acids include aspartic 

5 acid and glutamic acid. Also included within the scope of 
the invention are proteins or fragments or derivatives 
thereof which exhibit the same or similar biological 
activity as the Scarf ace 1 described herein, and derivatives 
which are differentially modified during 

10 or after translation, e.g., by glycosylation, proteolytic 
cleavage, linkage to an antibody molecule or other cellular 
ligand, etc. 

The term ^^host cell" refers to any eucaryotic or 
procaryotic cell that is suitable for propagating and/or 
15 expressing a cloned gene contained on a vector that is 
introduced into said host cell by, for exeunple, 
transformation or transf ection, or the like. 

Specific examples of Scarface 1 genes and/or 
proteins are referred to herein as **Scarface 1" (SEQ ID 
20 NO:l/SEQ ID NO: 2) . Scarf ace 1 molecules are related to 
angiopoietin 1, which mediates angiogenesis during 
embryogenic said later development. 

The term '^homolog*' or ^^homologous" describes the 
relationship between different nucleic acid molecules, or 
25 amino acid sequences, in which said sequences or molecules 
are related by partial identity or chemical /physical 
similarity at one or more blocks or regions within said 
molecules or sequences. 

The term "hybridization" as used herein refers to 
30 a process in which a single- stranded nucleic acid molecule 
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joins with a complementary strand through nucleotide base 
pairing. "Selective hybridization" refers to hybridization 
under conditions of high stringency. The degree of 
hybridization depends upon, for example, the degree of 

5 homology, the stringency of hybridization, and the length of 
hybridizing strands. 

The term "isolated nucleic acid compound" refers 
to any RNA or DNA sequence, however constructed or 
synthesized, which is locationally distinct from its natural 

10 location. 

The term ^"modulate" or ^^modulating" refers to 
stimulating or inhibiting the object thereof, for example, 
stimulating or inhibiting angiogenesis . 

The symbol '^N" in a nucleic acid sequence refers 
15 to adenine ("W), guanine ('"G"), cytosine (^C), thymine 
(T") , or uracil («U") . 

The term ^nucleic acid probe" or "probe" as used 
herein is a labeled nucleic acid compound which hybridizes 
with another nucleic acid compound. *^Nucleic acid probe" 
20 means a single stranded nucleic acid sequence that will 
combine with a complementary or partially complementary 
single stranded target nucleic acid sequence to form a 
doiible- stranded molecule. A nucleic acid probe may be an 
oligonucleotide or a nucleotide polymer. A probe will 
25 usually contain a detectable moiety which may be attached to 
the end(s) of the probe or be internal to the sequence of 
the probe. 

The term ^'orthologue" or "^orthologous" refers to 
two or more genes or proteins from different organisms that 
30 exhibit sequence homology. 
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The term **paralogue" or ^paralogous" refers to two 
or more genes or proteins within a single organism that 
exhibit sequence homology. 

The term ^'peptide" or "protein" are used 
5 interchangeably to mean a biopolymer comprising a linear 
array of two or more peptide bonded amino acids . 

The term "plasmid" refers to an extrachromosomal 
genetic element. The plasmids disclosed herein are 
commercially available, publicly available on an 
10 unrestricted basis, or can be constructed from readily 

available plasmids in accordance with published procedures. 

A "primer" is a nucleic acid fragment which 
functions as an initiating sxibstrate for enzymatic or 
synthetic elongation of, for example, a nucleic acid 
15 molecule. 

The term "promoter" refers to a nucleic acid 
sequence that directs transcription, for exaitple, of DNA to 
RNA. An inducible promoter is one that is regulatable by 
environmental signals, such as carbon source, heat, or metal 

20 ions, for example. A constitutive promoter generally 
operates at a constant level and is not regulatable. 

The term "recombinant DNA cloning vector" as used 
herein refers to any autonomously replicating agent, 
including, but not limited to, plasmids and phages, 

25 comprising a DNA molecule to which one or more additional 
DNA segments can or have been incorporated. 

The term "recombinant DNA expression vector" or 
"expression vector" as used herein refers to any recombinant 
DNA cloning vector, for example a plasmid or phage, in which 

30 a promoter and other regulatory elements are present thereby 
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enabling transcription of an inserted DNA, which may encode 
a protein. 

The term «Scarface 1 activity" or ''Scarface 1-like 
activity" refers to one or more biological activities 
exhibited by Scarf ace 1, for example, the ability to 
modulate angiogenesis, in vivo or any suitable in vitro 
proxy assay, or the ability to stimulate phosphorylation of 
Tie 2, or any other biological effect or activity of 
Scarf ace 1 or a functional region of Scarf ace 1, for 
example, the fibrinogen- like domain. 

The term ^'stem cell" refers generally to an 
unspecialized cell type that gives rise to differentiated 
cells. For example, hematopoietic stem cells in bone marrow 
give rise to cells of the blood, e.g. B-cells and red blood 
cells . 

The term "stringency" refers to hybridization 
conditions. High stringency conditions disfavor non- 
homologous base pairing. Low stringency conditions have the 
opposite effect. Stringency may be altered, for example, by 
temperature and salt concentration. 

The term ""low stringency" refers to conditions 
that comprise, for example, a temperature of about 37° C or 
less, a formamide concentration of less than about 50%, and 
a moderate to low salt (SSC) concentration; or, 
alternatively, a temperature of about 50° C or less, and a 
moderate to high salt (SSPE) concentration, for example IM 
NaCl. 

The term "high stringency" refers to conditions 
that comprise, for example, a temperature of about 42° C or 
less, a formamide concentration of less than about 20%, and 
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a low salt (SSC) concentration; or, alternatively, a 
temperature of about 65° C, or less, and a low salt (SSPE) 
concentration. For example, high stringency conditions 
comprise hybridization in 0 . 5 M NaHP04, 7% sodium dodecyl 

5 sulfate (SDS) , 1 mM EDTA at 65°C (Ausubel, F.M* et al . 

Current Protocols in Molecular Biology , Vol. I, 1989; Green 
Inc . New York, at 2.10.3)- 

The term «SSC" comprises a hybridization and wash 
solution. A stock 2 OX SSC solution contains 3M sodium 

10 chloride, 0.3M sodium citrate, pH 7.0. 

The term ^*SSPE" comprises a hybridization and wash 
solution. A IX SSPE solution contains 180 mM NaCl, 9mM 
NaaHPO^, 0.9 mM NaHaPO^ and 1 mM EDTA, pH 7.4. 

The term « substantially pure," used in reference 

15 to a peptide or protein, means separation from other 

cellular and non-cellular molecules, including other protein 
molecules. A substantially pure preparation would be about 
at least 85% pure; preferably about at least 95% pure. A 
substantially pure" protein can be prepared by a variety of 

20 techniques, well known to the skilled artisan, including, 
for example, the IMAC protein purification method. 

The term "vector" as used herein refers to a 
nucleic acid compound used for introducing exogenous or 
endogenous DNA into host cells. A vector comprises a 

25 nucleotide sequence which may encode one or more protein 
molecules. Plasmids, cosmids, viruses, and bacteriophages, 
in the natural state or which have undergone recombinant 
engineering, are examples of commonly used vectors. 

The various restriction enzymes disclosed and described 

30 herein are commercially available and the manner of use of 
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said enzymes including reaction conditions, cof actors, and 
other requirements for activity are well known to one of 
ordinary skill in the art. Reaction conditions for 
particular enzymes were carried out according to the 
manufacturer ' s recommendation . 

The Scarface 1 genes encode proteins that are related 
to mammalian angiopoietin 1 and to the Drosophila ^Scabrous" 
protein. Species of Scarface 1 DNA sequences are disclosed 
herein by SEQ ID NO:l (Scarface 1) . Those skilled in the art 
will recognize that owing to the degeneracy of the genetic 
code, numerous "silent" substitutions of nucleotide base 
pairs could be introduced into the sequences identified 
herein without altering the identity of the encoded amino 
acid(s) or protein or peptide product. All such 
suLbstitutions are intended to be within the scope of the 
invention. 

Specific examples of a Scarface 1 cDNA and protein are 
identified herein as SEQ ID NO:l and SEQ ID NO:2, 
respectively. Scarface 1 (SEQ ID NO: 2) comprises multiple 
functional domains including a fibrinogen- like domain from 
approximately about residue 275 through 493, and a coiled- 
coil domain at the amino terminal end from approximately 
residues 1 through 274. 

Also contemplated by the present invention are peptides 
that are functionally related to Scarface 1. For example, 
peptides that are functionally related to SEQ ID NO: 2 may be 
produced by conservative amino acid substitutions, 
replacements, deletions, or insertions, at one or more amino 
acid positions within Scarface 1, in accordance with the 
Table presented herein. 
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Modifications of Scarface 1 peptides made in accordance 
with the Table are generally expected to retain the 
biological activity of the parent molecule based on the art 
recognized substitutability of certain amino acids {See e.g. 
5 M. Dayhoff, In Atlas of Protein Sequence and Structure ^ Vol. 
5, Supp. 3, pgs 345-352, 1978) . Functionality is easily 
tested in an assay that measures phosphorylation of the Tie2 
receptor, for exanple. 
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ORIGINAL RESIDUE 


EXEMPLARY 


SUBSTITUTIONS 
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THR 




ARG 


liYS 
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HIS, 


SER 




ASP 




ASN 
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SER 






GLN 


ASN, 


HIS 




GLU 


ASP, 


GLU 




GLY 
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SER 
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LEU, 


VAL, 
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ILE, 


VAL 




LYS 
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GLN, 


OT.TT TTTP 
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MET 


LEU, 
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PHE 


LEU, 


TYR 
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THR, 


ALA, 


ASN 


THR 


SER, 


ALA 




TRP 


ARG, 


SBR 




TYR 


PHE 






VAL 


ILE, 


LEU, 


ALA 


PRO 


ALA 







Fragments of Scarface 1 proteins 

One embodiment of the instant invention provides 
fragments of the peptides disclosed that may or may not be 
5 biologically active. Such fragments are useful, for example, 
as an antigen for producing an antibody to said proteins. 

Fragments of the proteins disclosed herein may be 
generated by any number of suitable techniques, including 
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chemical synthesis of any portion of SEQ ID NO: 2, 
proteolytic digestion of said peptides, or most preferably, 
by recombinant DNA mutagenesis techniques, well known to the 
skilled artisan. See. e.g. K. Struhl, ^^Reverse biochemistry: 

5 Methods and applications for synthesizing yeast proteins in 
vitro," Meth. Enzymol, 194, 520-535. For example, in a 
preferred method, a nested set of deletion mutations are 
introduced into a gene or gene fragment (e.g, SEQ ID NO:l) 
encoding an Scarface 1 peptide such that varying amounts of 

10 the peptide coding region are deleted, either from the amino 
terminal end, or from the carboxyl end of the protein 
molecule. This method can also be used to create internal 
fragments of the intact protein in which both the carboxyl 
and amino terminal ends are removed. Several appropriate 

15 nucleases can be used to create such deletions, for example 
BaI31, or in the case of a single stranded nucleic acid 
molecule, mung bean nuclease. For simplicity, it is 
preferred that the Scarface 1 gene be cloned into a single- 
stranded cloning vector, such as bacteriophage M13, or 

20 equivalent. If desired, the resulting deletion fragments can 
be subcloned into any suitable vector for propagation and 
expression in any suitable host cell. 



invention may be produced as described above, preferably 
25 using cloning techniques to engineer smaller versions of the 
intact gene, lacking sequence from the 5' end, the 3' end, 
from both ends, or from an internal site. Smaller fragments 
of the genes of this invention may be used to produce the 
encoded peptides, which may be tested for biological 



Functional fragments of the proteins of this 
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activity using any suitable assay, for example, the ability 
of a protein fragment to stimulate phosphorylation of Tie2. 

Gene Isolation Procedures 



Scarface 1 genes could be obtained by a plurality of 
recombinant DNA techniques including, for example, 
hybridization, polymerase chain reaction (PGR) 
amplification, or de novo DNA synthesis. (See e.g., T. 

10 Maniatis et al . Molecular Cloning: A Laboratory Manual , 2d 
Ed. Chap. 18 (1989) ) . 

Methods for constructing cDNA libraries in a suitable 
vector such as a plasmid or phage for propagation in 
procaryotic or eucaryotic cells are well known to those 

15 skilled in the art. [See e.g. Maniatis et al. Supra]. 
Suitable cloning vectors are well known and are widely 
available . 

The Scarface 1 genes, or fragments thereof, can be 
isolated from any tissue in which said genes are expressed. 

20 For excunple. Northern analysis reveals that Scarface 1 is 

expressed strongly in muscle, bone marrow, spleen, and lymph 
node, and at lower levels in thymus and fetal liver. 

In one method for gene isolation, mRNA is isolated from 
a suitable tissue, and first strand cDNA synthesis is 

25 carried out. Second round DNA synthesis can be carried out 
for the production of the second strand. If desired, the 
double -stranded cDNA can be cloned into any suitable vector, 
for example, a plasmid, thereby forming a cDNA library. 
Oligonucleotide primers targeted to any suitable region of 

30 the sequences disclosed herein can be used for PGR 



5 



Those skilled in the art will recognize that the 
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amplification of Scarf ace 1 genes. See e.g. PGR Protocols; A 
Guide to Method and Application , Ed. M. Innis et al.. 
Academic Press (1990) . The PGR amplification comprises 
template DNA, suitable enzymes, primers, and buffers, and is 
5 conveniently carried out in a DNA Thermal Cycler (Perkin 
Elmer Cetus, Norwalk, GT) . A positive result is determined 
by detecting an appropriately- sized DNA fragment following 
agarose gel electrophoresis. 



10 Protein Production Methods 

One embodiment of the present invention relates to 
substantially purified proteins encoded by Scarf ace 1 genes. 

Skilled artisans will recognize that the proteins 
of the present invention can be synthesized by a number of 

15 different methods, such as chemical methods well known in 
the art, including solid phase peptide synthesis or 
recombinant methods. Both methods are described in U.S. 
Patent 4,617,189, incorporated herein by reference. 

The principles of solid phase chemical synthesis 

20 of polypeptides are well known in the art and may be found 
in general texts in the area. See, e.g., H. Dugas and G. 
Penney, Bioorganic Ghemistry (1981) Springer-Verlag, New 
York, 54-92. For example, peptides may be synthesized by 
solid-phase methodology utilizing an Applied Biosystems 43 OA 

25 peptide synthesizer (Applied Biosystems, Poster City, CA) 
and synthesis cycles supplied by Applied Biosystems. 

The proteins of the present invention can also be 
produced by recombinant DNA methods using a cloned Scarf ace 
1 gene as template. Recombinant methods are preferred if a 

30 high yield is desired. Expression of an Scarf ace 1 gene can 
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be carried out in a variety of suitable host cells, well 
known to those skilled in the art. For example, SEQ ID NO:l 
is introduced into a host cell by any suitable means, well 
known to those skilled in the art. While chromosomal 

5 integration of the cloned gene is within the scope of the 
present invention, it is preferred that the gene be cloned 
into a suitable extra- chromosomal ly maintained e3q>ression 
vector so that the coding region of the Scarface 1 gene is 
operably- linked to a constitutive or inducible promoter. 

10 The basic steps in the recombinant production of 

an Scarface 1 protein are : 

a) constructing a natural, synthetic or 
semi- synthetic DNA encoding an Scarface 1 
protein; 

15 

b) integrating said DNA into an e3q>ression 
vector in a manner suitable for expressing 
the Scarface 1 protein, either alone or as a 
fusion protein; 

20 

c) transforming or otherwise introducing 
said vector into an appropriate eucaryotic or 
prokaryotic host cell forming a recombinant 
host cell, 

25 

d) culturing said recombinant host cell in 
a manner to express the Scarface 1 protein; 
and 
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e) recovering and substantially purifying 
the Scarf ace 1 protein by any suitable means, 
well knovm to those skilled in the art. 
In addition, the invention further contemplates compositions 

5 comprising the receptor binding domain of Scarf ace 1 . For 
example. Scarf ace 1 consists of a "coiled coil" domain 
(beginning at the amino terminal end and extending to about 
position 274 of SEQ ID NO: 2) and a fibrinogen- like domain 
(beginning at about position 275 and extending through the C 

10 terminal end) . Mult imerizat ion of the coiled- coil domains 
during production of the protein hampers purification. The 
fibrinogen- like domain is likely to comprise the TIE- 2 
receptor binding domain. The monomeric forms of the 
fibrinogen- like domain do not, however, appear to 

15 bind the receptor. Methods of production of "clustered 
ligands and ligandbodies are described in Davis, et al. 
Science 266:816-819 (1994) . Based on these findings the 
invention further relates to "ligandbodies, " or Scarf ace 1 
fusion proteins, which comprise the fibrinogen- like domain 

20 of Scar face 1 coupled to the Fc domain of IgG, for example. 
These fusion proteins form dimers that bind the TIE- 2 
receptor. Accordingly, the present invention contemplates 
the production of Scarface 1 fusion proteins which may be 
used as targeting agents, in diagnostics or in therapeutic 

25 applications, such as targeting agents for tximors and/or 
associated vasculature wherein a TIE antagonist is 
indicated. 

The invention herein further provides for the 
development of the Scarface 1 ligand, a fragment or 
30 derivative thereof, or another molecule which is a receptor 
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agonist or antagonist, as a therapeutic for the treatment of 
patients suffering from disorders involving cells, tissues 
or organs which express the TIE receptor. Such molecules may 
be used in a method of treatment of the human or animal 

5 body, or in a method of diagnosis. 

Because TIE receptor has been identified in association 
with endothelial cells, applicants expect that Scarf ace 1 
may be useful for the induction of vascularization in 
diseases or disorders where such vascularization is 

10 indicated. Such diseases or disorders would include wound 
healing, ischaemia and diabetes. The ligands may be tested 
in animal models and used therapeutically as described for 
other agents, such as vascular endothelial growth factor 
(VEGF) , another endothelial cell- specific factor that is 

1 5 angi ogeni c . 

For example, various in vivo and/or in vitro studies 
are useful to demonstrate the effect of an angiogenic factor 
in enhancing blood flow to ischemic myocardium, enhancing 
wound healing, and in other therapeutic settings wherein 

20 neoangiogenesis is desired. See Ferrara, et al. U.S. Pat. 
No. 5,332,671; European Patent Application 0 550 296 A2; 
Banai, et al. Circulation 89, 2183-2189 (1994); Unger, et 
al. Am. J. Physiol. 266, H1588-H1595 (1994); Lazarous, et 
al. Circulation 91, 145-153 (1995), all of which are herein 

25 incorporated by reference. 

According to the invention. Scar face 1 may be used 
alone or in combination with one or more additional 
pharmaceutical ly active compounds such as, for example, VEGF 
or basic fibroblast growth factor (bFGF) , as well as 

30 cytokines, neurotrophins , etc. 
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Expressing Recombinant Scarface 1 Protein in Procaryotic ctnd 
Eucaryotic Host Cells 

Procaryotes may be employed in the production of 

5 recombinant Scarface 1 protein. For example, the 
Escherichia, coli K12 strain 294 (ATCC No. 31846) is 
particularly useful for the prokaryotic expression of 
foreign proteins. Other strains of J5. coli, bacilli such as 
Bacillus sxibtilis, enterobacteriaceae such as Salmonella 

10 typhimurium or Serratia marcescens, various Pseudomonas 

species and other bacteria, such as Streptomyces, may also 
be employed as host cells in the cloning and expression of 
the recombinant proteins of this invention. 

Promoter sequences suitable for driving the 

15 egression of genes in procaryotes include b -lactamase 

[e.g. vector pGX2907, ATCC 39344, contains a replicon and b 
-lactamase gene], lactose systems [Chang et al., Nature 
(London), 275:615 (1978); Goeddel et al . , Nature (London), 
281:544 (1979)], alkaline phosphatase, and the tryptophan 

20 (trp) promoter system [vector pATHl (ATCC 37695)1, which is 
designed to facilitate expression of an open reading frame 
as a trpE fusion protein xinder the control of the trp 
promoter. Hybrid promoters such as the tac promoter 
(isolatable from plasmid pDR540, ATCC- 3 72 82) are also 

25 suitable. Still other bacterial promoters, whose nucleotide 
sequences are generally known, may be ligated to DNA 
encoding the protein of the instant invention, using linkers 
or adapters to supply any required restriction sites* 
Promoters for use in bacterial systems also will contain a 

30 Shine -Dalgarno sequence operably- linked to the DNA encoding 
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the desired polypeptides. These exair^les are illustrative 
rather than limiting. 

The proteins of this invention may be synthesized 
either by direct expression or as a fusion protein 
5 comprising the protein of interest as a translational fusion 
with another protein or peptide which may be removable by 
enzymatic or chemical cleavage. It is often obseirved in the 
production of certain peptides in recombinant systems that 
expression as a fusion protein prolongs the lifespan, 
10 increases the yield of the desired peptide, or provides a 
convenient means of purifying the protein. This is 
particularly relevant when expressing mammalicin proteins in 
procaryotic hosts. A variety of peptidases (e.g. 
enterokinase and thrombin) which cleave a polypeptide at 
15 specific sites or digest the peptides from the amino or 

carboxy termini (e.g. diaminopeptidase) of the peptide chain 
are known. Furthermore, particular chemicals (e.g. cyanogen 
bromide) will cleave a polypeptide chain at specific sites. 
The skilled artisan will appreciate the modifications 
20 necessary to the amino acid sequence (and synthetic or semi- 
synthetic coding sequence if recombinant means are employed) 
to incorporate site-specific internal cleavage sites. See 
e.g., P. Carter, "Site Specific Proteolysis of Fusion 
Proteins", Chapter 13, in Protein Purification: From 
25 Molecular Mechanisms to Large Scale Processes , American 
Chemical Society, Washington, D.C. (1990) . 

One useful Scarface 1 fusion protein is a GST 
fusion protein in which a Scarface 1 protein of the 
invention is fused to the C-terminus of GST sequences. Such 
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fusion proteins are useful, for example, for purification of 
a recombinant polypeptide of the invention. 

In another embodiment of the invention, the fusion 
partner comprises an immunoglobulin peptide or protein in 

5 which part or all of a Scarf ace 1 protein or analog, for 
example, the fibrinogen- like domain, is fused to the 
constant region of an IgG molecule. Such fusion proteins can 
facilitate purification and may be useful in pharmaceutical 
compositions or in screens for compoiinds that modulate the 

10 interaction of Scarf ace 1 with its receptor. 

In addition to procaryotes, a variety of amphibian 
expression systems such as frog oocytes, and mammalian cell 
systems can be used. The choice of a particular host cell 
depends to some extent on the particular expression vector 

15 used. Exemplary mammalian host cells suitable for use in 
the present invention include HepG-2 (ATCC HB 8065) , CV-1 
(ATCC CCL 70), LC-MK2 (ATCC CCL 7.1), 3T3 (ATCC CCL 92), 
CHO-Kl (ATCC CCL 61) , HeLa (ATCC CCL 2) , RPMI8226 (ATCC CCL 
155), H4IIEC3 (ATCC CCL 1600), C127I (ATCC CCL 1616), HS- 

20 Sultan (ATCC CCL 1884), and BHK-21 (ATCC CCL 10), for 
example . 

A wide variety of vectors are suitable for 
transforming mammalian host cells. For exsunple, the pSV2- 
type vectors comprise segments of the simian virus 40 (SV40) 

25 genome required for transcription and polyadenylation. A 
large n\imber of plasmid pSV2-type vectors have been 
constructed, such as pSV2-gpt, pSV2-neo, pSV2-dhfr, pSV2- 
hyg, and pSV2-b-globin, in which the SV40 promoter drives 
transcription of an inserted gene. These vectors are widely 

30 available from sources such as the American Type Culture 
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Collection (ATCC) , 12301 Parklawn Drive, Rockville, 
Maryland, 20852, or the Northern Regional Research 
Laboratory (NRRL) , 1815 N. University Street, Peoria, 
Illinois, 61604. 

5 Promoters suitable for expression in mammalian 

cells include the SV40 late promoter, promoters from 
eukaryotic genes, such as, for example, the estrogen- 
inducible chicken ovalbximin gene, the interferon genes, the 
glucocorticoid- inducible tyrosine aminotransferase gene, the 

10 thymidine kinase gene promoter, and the promoters of the 
major early and late adenovirus genes. 

Plasmid pRSVcat (ATCC 37152) comprises portions of 
a long terminal repeat of the Rous Sarcoma virus, a virus 
known to infect chickens and other host cells. This long 

15 terminal repeat contains a promoter which is suitable for 
use in the vectors of this invention. H. Gorman et al., 
Proc. Nat. Acad. Sci. (USA), 79, 6777 (1982). The plasmid 
pMSVi (NRRIi B- 15929) comprises the long terminal repeats of 
the Murine Sarcoma virus, a virus known to infect mouse and 

20 other host cells. The mouse metallothionein promoter has 
also been well characterized for use in eukaryotic host 
cells and is suitable for use in the present invention. 
This promoter is present in the plasmid pdBPV-MMTneo (ATCC 
37224) which can serve as the starting material for the 

25 construction of other plasmids of the present invention. 

Transfection of mammalian cells with vectors can 
be performed by a plurality of well known processes 
including, but not limited to, protoplast fusion, calciiam 
phosphate co-precipitation, electroporation and the like. 

30 See, e.g., Maniatis et al., supra. 
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Some viruses also make appropriate vectors. 
Examples include the adenoviruses, the adeno-associated 
viruses, the vaccinia virus, the herpes viruses, the 
baculoviruses, and the rous sarcoma virus, as described in 

5 U.S. Patent 4,775,624, incorporated herein by reference. 

Eucaryotic microorganisms such as yeast and other 
fungi are also suitable host cells. The yeast Seiccharomyces 
cerevislae is the preferred eucaryotic microorganism. Other 
yeasts such as Kluyveromyces lactis and Pichia pastoris are 

10 also suitable. For expression in Saccharomyces , the plasmid 
YRp7 (ATCC-40053) , for example, may be used. 5ee, e.gr., L. 
Stinchcomb et al . , Nature, 282, 39 (1979); J. Kingsman et 
al.. Gene, 7, 181 (1979); S. Tschemper et al.. Gene, 10, 157 
(1980) . Plasmid YRp7 contains the TRPl gene which provides 

15 a selectable marker for use in a trpl auxotrophic mutant. 

Purification of Recombinantly- Produced Scarf ace 1 Proteins 

An expression vector carrying a cloned Scarface 1 
gene or fragment thereof (e.g. SEQ ID N0:1) is transformed 

20 or transfected into a suitable host cell using standard 

methods. Cells that contain the vector are propagated under 
conditions suitable for expression of a recombinant Scarface 
1 protein. For Example, if the recombinant gene or fragment 
thereof has been placed xinder the control of an inducible 

25 promoter, suitable growth conditions would incorporate the 
appropriate inducer. The recombinantly-produced protein may 
be purified from cellular extracts of transformed cells by 
any suitable means. 

In a preferred process for protein purification, 

30 an Scarface 1 gene or fragment thereof is modified at the 5 ' 



• 
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end to incorporate several histidine codons. This 
modification produces an "histidine tag" at the amino 
terminus of the encoded protein, that enables a single- step 
protein purification method [i.e. "immobilized metal ion 

5 affinity chromatography" (IMAC)], essentially as described 
in U.S. Patent 4,569,794, which hereby is incorporated by 
reference. The IMAC method enables rapid isolation of 
s\ibstantially pure recombinant Scarface 1 protein starting 
from a crude extract of cells that express a modified 

10 recombinant protein, as described above. 
Production of Antibodies 



thereof may be used in the production of antibodies that 
selectively bind Scarface 1. The instant invention also 
15 encompasses single -chain polypeptide binding molecules. 



polyclonal, in animals, especially mice, is well known in 
the art. See, e.g., C. Milstein, Handbook of Experimental 
Immunology , (Blackwell Scientific Pub., 1986); J. Coding, 

20 Monoclonal Antibodies; Principles and Practice , (Academic 
Press, 1983) . For the production of monoclonal antibodies 
the basic process begins with injecting a mouse, or other 
suitable animal, with an immxinogen. The mouse is 
subsequently sacrificed and cells taken from its spleen are 

25 fused with myeloma cells, resulting in a hybridoma that 
reproduces in vitro. The population of hybridomas is 
screened to isolate individual clones, each of which 
secretes a single antibody species, specific for the 
immunogen. Each antibody obtained in this way is the clonal 

30 product of a single B cell. 



The proteins of this invention and fragments 



The production of antibodies, both monoclonal and 



• 
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Chimeric antibodies are described in U.S. Patent 



No. 4,816,567, the entire contents of which is herein 
incorporated by reference. This reference discloses methods 
and vectors for the preparation of chimeric antibodies . An 

5 alternative approach is provided in U.S. Patent No. 
4,816,397, the entire contents of which is herein 
incorporated by reference. This patent teaches co- 
expression of the heavy and light chains of an antibody in 
the same host cell. 

10 The approach of U.S. Patent 4,816,397 has been 

further refined in European Patent Publication No. 0 239 
400. The teachings of this European patent publication are a 
preferred format for genetic engineering of monoclonal 
antibodies. In this technology the complementarity 

15 determining regions (CDRs) of a human antibody are replaced 
with the CDRs of a murine monoclonal antibody, thereby 
converting the specificity of the human antibody to the 
specificity of a murine antibody. 



20 yet another variety of genetically engineered antibody 

technology that is well known in the art. (See, e.g. R.E. 
Bird, et al.. Science 242:423-426 (1988); PCT Pviblication 
Nos. WO 88/01649, WO 90/18430, and WO 91/10737. Single chain 
antibody technology involves covalently joining the binding 

25 regions of heavy and light chains to generate a single 
polypeptide chain. The binding specificity of the intact 
antibody molecule is thereby reproduced on a single 
polypeptide chain. 



Single chain antibodies and libraries thereof are 



• 
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The antibodies contenrplated by this invention are 
useful in diagnostics, therapeutics or in 
diagnostic/therapeutic combinations . 

The Scarface 1 proteins or analogs of this invention or 

5 suitable fragments thereof can be used to generate 

polyclonal or monoclonal antibodies ^ and various inter- 
species hybrids, or humanized antibodies, or antibody 
fragments, or single-chain antibodies. The techniques for 
producing antibodies are well knovm to skilled artisans. 

10 {See e.g. A.M. Campbell, Monoclonal Antibody Technology; 

Laboratory Techniques in Biochemsitry and Molecular Biology , 
Elsevier Science Publishers, Amsterdam (1984) ; Kohler and 
Milstein, Nature 256, 495-497 (1975); Monoclonal Antibodies; 
Principles & Applications Ed. J. R. Birch & E.S. Lennox, 

15 Wiley-Liss , 1995 . 

A protein used as an immunogen may be modified or 
administered in an adjuvant, by subcutaneous or 
intraperitoneal injection into, for example, a mouse or a 
rabbit. For the production of monoclonal antibodies, spleen 

20 cells from immunized animals are removed, fused with myeloma 
cells, such as SP2/0-Agl8 cells, and allowed to become 
monoclonal antibody producing hybridoma cells in the manner 
known to the skilled artisan. Hybridomas that secrete a 
desired antibody molecule can be screened by a variety of 

25 well known methods, for example ELISA assay, western blot 
analysis, or radioimmunoassay (Lutz et al. Exp. Cell Res. 
175, 109-124 (1988); Monoclonal Antibodies; Principles & 
Applications Ed. J. R. Birch & E.S. Lennox, Wiley-Liss, 1995) . 



30 Procedures for Icd^eling antibody molecules are widely known. 



For some applications labeled antibodies are desirable. 
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including for example, the use of radioisotopes, affinity 
labels, such as biotin or avidin, enzymatic labels, for 
example horseradish peroxidase, and fluorescent labels, such 
as FITC or rhodamine {See e.g. Enzyme -Mediated Immunoas say , 

5 Ed. T. Ngo, H, Lenhoff, Plenxim Press 1985; Principles of 
Immunology and Immunodiagnostics , R.M. Aloisi, Lea & 
Febiger, 1988) . 

Labeled antibodies are useful for a variety of 
diagnostic applications. In one embodiment the present 

10 invention relates to the use of labeled antibodies to detect 
the presence of Scarf ace 1 proteins. Alternatively, the 
antibodies could be used in a screen to identify potential 
modulators of Scar face 1. For example, in a competitive 
displacement assay, the smtibody or compound to be tested is 

15 labeled by any suitable method. Competitive displacement of 
an antibody from an antibody- antigen complex by a test 
compound such that a test compound- antigen complex is formed 
provides a method for identifying compounds that bind 
Scarf ace 1 . 

20 other embodiments of the present invention 

comprise isolated nucleic acid sequences that encode the 
proteins of the invention, for example, SEQ ID NO: 2. Also 
contemplated are related nucleic acids that hybridize to SEQ 
ID N0:1 under high stringency conditions. Such sequences may 

25 come, for example, from paralogous or orthologous genes. 

The Scarf ace 1 genes of the invention (e.g. SEQ ID 
NO:l) cuid related nucleic acid molecules may be produced by 
chemical synthetic methods. The synthesis of nucleic acids 
is well known in the art. See, e.g., E.L. Brown, R. 

30 Belagaje, M.J. Ryan, and H.G. Khorana, Methods In 
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Enzymology, 68:109-151 (1979) • Nucleic acids of this 
invention, including those disclosed herein, could be 
generated using a conventional DNA synthesizing apparatus, 
such as the Applied Biosystems Model 380A or 3 SOB DNA 

5 synthesizers (Applied Biosystems, Inc., 850 Lincoln Center 
Drive, Foster City, CA 94404) using phosphoramidite 
chemistry, thereafter ligating the fragments so as to 
reconstitute the entire gene. Alternatively, 
phosphotriester chemistry may be employed to synthesize the 

10 nucleic acids of this invention. (See, e.g., M.J. Gait, 
ed . , Oligonucleotide Synthesis, A Practical Approach , 
(1984) ) . 

In an alternative methodology, namely PGR, the 
DNA sequences disclosed and described herein, can be 

15 produced from a plurality of starting materials. For 
example, starting with a cDNA preparation (e.g. cDNA 
library) derived from a tissue that expresses an Scarface 1 
gene, suitable oligonucleotide primers complementary to, for 
example, SEQ ID NO:l, or to a sub-region therein, are 

20 prepared as described in U.S. Patent No. 4,889,818, hereby 
incorporated by reference. Other suitable protocols for the 
PCR are disclosed in PCR Protocols; A Guide to Method and 
Applications , Ed. Michael A. Innis et al.. Academic Press, 
Inc. (1990) . Using PCR, any region of the Scarface 1 gene(s) 

25 can be targeted for amplification such that full or partial 
length gene sequences may be produced. 

The ribonucleic acids of the present invention may 
be prepared using polynucleotide synthetic methods discussed 
supra, or they may be prepared enzymatically, for example. 
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using RNA polymerase to treuiscribe a Scarface 1 DNA 
template • 

The most preferred systems for preparing the 
ribonucleic acids of the present invention employ the RNA 
5 polymerase from the bacteriophage T7 or the bacteriophage 
SP6. These RNA polymerases are highly specific, requiring 
the insertion of bacteriophage -specific sequences at the 5' 
end of the template to be transcribed. See, Maniatis et al., 
supra . 

10 This invention also provides nucleic acids, RNA or 

DNA, that are complementary to Scarface 1 genes or fragments 
thereof, for example, SEQ ID NO:l. 
Nucleic Acid Probes 

The present invention also provides probes and 

15 primers useful, for example, in hybridization screens of 
genomic, subgenomic, or cDNA libraries, as well as 
hybridization against nucleic acids derived from cell lines 
or tissues. Such hybridization screens are useful as methods 
to identify and isolate full length, or identical, 

20 homologous or functionally related sequences from hxaman or 
other organisms. 

In one embodiment the present invention relates to 
the use of a nucleic acid disclosed herein as a probe to 
identify and isolate full-length genes comprising said 

25 nucleic acids. A nucleic acid compound comprising SEQ ID 
N0:1, or a complementary sequence thereof, or a fragment 
thereof, which is at least 18 base pairs in length, and 
which will selectively hybridize to DNA or mRNA encoding 
Scarface 1 protein or fragment thereof, or a functionally 

30 related protein, is provided. Preferably, the 18 or more 
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base pair compound is DNA. See e.g. B. Wallace and G. 
Miyada, "Oligonucleotide Probes for the Screening of 
Recombinant DNA Libraries," In Meth. Enzym. , 152, 432-442, 
Academic Press (1987) . 

Probes and primers can be prepared by enzymatic or 
recombinant methods, well known to those skilled in the art 
(See e.gr. Sambrook et al. supra) • A probe may be a single 
stranded nucleic acid sequence which is complementary in 
some particular degree to a nucleic acid sequence sought to 
be detected ("target sequence") . A probe may be labeled with 
a detectable moiety such as a radio- isotope, antigen, or 
chemiluminescent moiety. A description of the use of nucleic 
acid hybridization as a procedure for the detection of 
particular nucleic acid sequences is described in U.S. 
Patent No. 4,851,330 to Kohne, entitled "Method for 
Detection, Identification and Quantitation of Non-Viral 
Organisms . " 

Having the DNA sequence of the present invention allows 
preparation of relatively short DNA (or RNA) sequences 
having the ability to specifically hybridize to nucleic 
acids that are homologous or identical to sequences 
disclosed herein. In these aspects, nucleic acid probes of 
an appropriate length are prepared. The ability of such 
nucleic acid probes to specifically hybridize to a 
polynucleotide encoding a Scarf ace 1 gene or related 
sequence lends particular utility in a variety of 
embodiments. Most importantly, the probes may be used in a 
variety of assays for detecting the presence of 
complementary sequences or related sequences in a given 
sample . 
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In certain embodiments, it is advantageous to use 
oligonucleotide primers. The sequence of such primers is 
designed using a polynucleotide of the present 
invention for use in detecting, amplifying or mutating a 

5 defined segment of a gene or polynucleotide that encodes an 
Scarf ace 1 polypeptide, using PGR technology. 

Preferred nucleic acid sequences employed for 
hybridization studies, or assays, include probe molecules 
that are complementary to at least an about 

10 18 to an about 70-nucleotide long stretch of a 

polynucleotide that encodes an Scarf ace 1 polypeptide, e.g. 
SEQ ID NO:l. Molecules having complementary sequences over 
stretches greater than 18 bases in length are generally 
preferred, in order to increase stability and selectivity of 

15 the hybrid. One will generally prefer to design nucleic acid 
molecules having gene- complementary stretches of 25 to 40 
nucleotides, 55 to 70 nucleotides, or even longer where 
desired. Such fragments may be readily prepared by, for 
example, directly synthesizing the fragment by chemical 

20 means, by application of nucleic acid reproduction 

technology, such as the PGR TM technology of U.S. Pat. No. 
4,603,102, herein incorporated by reference, or by excising 
selected DNA fragments from recombinant plasmids containing 
appropriate inserts and suitable restriction enzyme sites. 

25 The following guidelines are useful for designing 

probes with desirable characteristics for detecting Scarface 
1 homologs. The extent and specificity of hybridization 
reactions are affected by a number of factors that determine 
the sensitivity and specificity of a particular probe, 

30 whether perfectly complementary to its target or not. The 
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affect of various experimental parameters and conditions are 
well known to those skilled in the art . 

First, the stability of the probe: target nucleic acid 
hybrid should be chosen to be compatible with the assay 
5 conditions. This may be accomplished by avoiding long A and 
T rich sequences, by terminating the hybrids with G:C base 
pairs and by designing a probe with an appropriate Tm (i.e. 
melting temperature) . The melting profile, including the Tm 
of a hybrid comprising an oligonucleotide and target 
10 sequence, may be determined using a Hybridization Protection 
Assay. The probe should be chosen so that the length and % 
GC content result in a Tm about a^'-lO^ C higher than the 
temperature at which the final assay will be performed. The 
base composition of the probe is also a significant factor 
15 because G-C base pairs exhibit greater thermal stability as 
compared to A-T base pairs. Thus, hybridization involving 
complementary nucleic acids of higher G-C content will be 
more stable at higher temperatures. 

The ionic strength and incubation temperature iinder 
20 which a probe will be used should also be taken into 

account. It is known that hybridization will increase as the 
ionic strength of the reaction mixture increases, and that 
the thermal stability of molecular hybrids will increase 
with increasing ionic strength. On the other hand, chemical 
25 reagents such as formamide, urea, DMSO and alcohols, which 
disrupt hydrogen bonds, increase the stringency of 
hybridization. Destabilization of hydrogen bonds by such 
reagents can greatly reduce the Tm. In general, optimal 
hybridization for synthetic oligonucleotide probes of about 
30 10-50 bases in length occurs approximately 5"* C below the 
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melting temperature for a given duplex. Incubation at 
temperatures below the optimiun may allow mismatched base 
sequences to hybridize and can therefore result in reduced 
specificity. 

5 The length of the target nucleic acid sequence and, 

accordingly, the length of the probe sequence can also be 
important. In some cases, there may be several sequences 
from a particular region, varying in location and length, 
which will yield probes with the desired hybridization 

10 characteristics. In other cases, one sequence may be 
significantly better than another even though the one 
secjuence differs merely by a single base. Finally, there can 
be intramolecular and intermolecular hybrids formed within a 
probe if there is sufficient self -complementarity. Such 

15 structures can be avoided through careful probe design. 

Conqputer programs are available to search for this type of 
interaction . 

A probe molecule may be used for hybridizing to a 
sample suspected of possessing a Scarface 1 or Scarface 1 
20 related nucleotide sequence. The hybridization reaction is 
carried out under suitable conditions of stringency, e.g. 
high stringency if searching for a closely related sequence, 
or low stringency if searching for a more distantly related 
sequence . 

25 Alternatively, such DNA molecules may be used in a 

number of techniques including their use as: (1) diagnostic 
tools to detect polymorphisms in DNA samples from a human or 
other mammal; (2) means for detecting and isolating homologs 
of Scarface 1 and related polypeptides from a DNA library 

30 potentially containing such sequences; (3) primers for 
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hybridizing to related sequences for the purpose of 
amplifying those sequences; and (4) primers for altering the 
native Scarf ace 1 DNA sequences; as well as other techniques 
which rely on the similarity of the DNA sequences to those 

5 of the Scarf ace 1 DNA segments herein disclosed. 

Once synthesized, oligonucleotide probes may be labeled 
by any of several well known methods. See e.g. Maniatis 
et.al.. Molecular Cloning (2d ed. 1989). Useful labels 
include radioisotopes, as well as non- radioactive reporting 

10 groups. Isotopic labels include H\ S^% I"^, Cobalt, and 

C". Most methods of isotopic labeling involve the use of 
enzymes and include methods such as nick- translation, end- 
labeling, second strand synthesis, and reverse 
transcription. When using radio-labeled probes, 

15 hybridization can be detected by autoradiography, 

scintillation counting, or gamma counting. The detection 
method selected will depend upon the hybridization 
conditions and the particular radio isotope used for 
labeling, 

20 Non- isotopic materials can also be used for labeling, 

and may be introduced internally into the sequence or at the 
end of the sequence. Modified nucleotides may be 
incorporated enzymatically or chemically, and chemical 
modifications of the probe may be performed during or after 

25 synthesis of the probe, for example, by the use of non- 
nucleotide linker groups. Non-isotopic labels include 
fluorescent molecules, chemilumine scent molecules, enzymes, 
cof actors, enzyme substrates, haptens or other ligands. 
In a preferred embodiment of the invention, the length of an 

30 oligonucleotide probe is greater than or equal to about 18 
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nucleotides and less than or equal to about 50 nucleotides. 
Labeling of an oligonucleotide of the present invention may 
be performed enzymatically using [^^P] -labeled ATP and the 
enzyme T4 polynucleotide kinase. 

5 In Silico Methods 

The invention further comprises in silico methods, or 
computer-based methods, wherein molecules that are 
structurally related, or structurally and functionally 
related to SEQ ID NO:l and/or SEQ ID NO: 2 are identified. In 

10 one embodiment, the method comprises comparing SEQ ID NO:l 
or SEQ ID NO: 2 against a database of nucleic acid sequences, 
or protein sequences (e.g. GenBank) to identify molecules 
that are homologous or related to said SEQ ID NO:l and /or 
SEQ ID NO: 2. For this purpose, the skilled artisan 

15 understands that any suitable search or comparison algorithm 
can be used to identify sequences that are related to SEQ ID 
NO:l and/or SEQ ID NO: 2. For exanqple, suitable algorithms 
include dyneunic programming algorithms. Hidden Markov Model 
algorithms, such as for example, the BLAST algorithm, or the 

20 Smith- Waterman algorithm, or FASTA. 

For example BLAST is used to search for local sequence 
alignments. BLAST produces alignments of both nucleotide cucid 
amino acid sequences. The fundamental unit of the BLAST 
output is the so-called High- scoring Sequence Pair (HSP) . An 

25 HSP consists of two sequence fragments of arbitrary but 
equal lengths whose alignment is locally maximal. BLAST 
looks for HSPs between a query sequence and a database 
sequence and retains those comparisons that exceed a user- 
selected statistical significance threshold. The E parameter 
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establishes the upper boiind of the expected frequency of a 
chcucice occurrence of an HSP- 

For example, a mathematical algorithm utilized for the 
comparison of two sequences is described in Karlin and 

5 Altschul (Proc. Nat. Acad. Sci., 87, 2264, 1990; herein 
incorporated by reference) . Such an algorithm is 
incorporated in to the NBLAST and XBIAST programs of 
Altschul et al. J. Mol. Biol., 215, 403 (1990). BLAST 
nucleotide searches can be performed with the NBLAST program 

10 with parameters set to, for example, score = 100, word 
length =12, to identify nucleic acid sequences that are 
homologous, or related to Scarf ace 1. Protein searches can 
be performed using the XBLAST program with parameters set 
to, for example, score = 50, word length = 3, to identify 

15 amino acid sequences that are homologous to SEQ ID NO: 2. To 
obtain gapped alignments. Gapped BLAST can be used, as 
described in Altschul et al. Nuc. Acids Res., 25, 3389 
(1997) ; all of which are herein incorporated by reference. 
Alternatively, PSI -BLAST can be used to perform an iterated 

20 search which detects distant relationships between 

molecules. If desired, the default parameters may be used in 
any of the above described algorithms. Alternatively, the 
ALIGN program, which is part of the GCG sequence alignment 
software, can be used to detect relationships. When using 

25 ALIGN for comparing amino acid sequences, the following 
parameters may be used: PAM120 weight residue table; gap 
length penalty 12, and gap penalty 4. 
Vectors 



30 recombinant DNA cloning vectors and expression vectors 



Another aspect of the present invention relates to 
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comprising the nucleic acids of the present invention. The 
preferred nucleic acid vectors are those which comprise DNA, 
for example, SEQ ID NO:l. 

The skilled artisan understands that choosing the 

5 most appropriate cloning vector or expression vector depends 
upon a number of factors including the availability of 
restriction enzyme sites, the type of host cell into which 
the vector is to be transfected or transformed, the purpose 
of the transfection or transformation (e.g., stable 

10 trcuisf ormation as an extrachromosomal element, or 

integration into the host chromosome) , the presence or 
absence of readily assayable or selectable markers (e.g., 
antibiotic resistance and metabolic markers of one type and 
another) , and the number of copies of the gene desired in 

15 the host cell. 

Vectors suitable to carry the nucleic acids of the 
present invention comprise RNA viruses, DNA viruses, lytic 
bacteriophages, lysogenic bacteriophages, stable 
bacteriophages, plasmids, viroids, and the like. The most 

20 preferred vectors are plasmids . 

When preparing an expression vector the skilled 
artisan understands that there are many variables to be 
considered, for example, whether to use a constitutive or 
inducible promoter. The practitioner also understeuids that 

25 the amount of nucleic acid or protein to be produced 

dictates, in part, the selection of the expression system. 
Regarding promoter sequences, inducible promoters are 
preferred because they enable high level, regulatable 
expression of an operably- linked gene. The skilled artisan 

30 will recognize a number of suitable promoters that respond 
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to a variety of inducers, for example, carbon source, metal 
ions, and heat. Other relevant considerations regarding an 
expression vector include whether to include sequences for 
directing the localization of a recombinamt protein. For 
example, a sequence encoding a signal peptide preceding the 
coding region of a gene is useful for directing the extra- 
cellular export of a resulting polypeptide. 

The present invention also provides a method for 
constructing a recombinant host cell capable of expressing 
the proteins of the invention, said method comprising 
transforming or otherwise introducing into a host cell a 
recombinant DNA vector that comprises an isolated DNA 
sec[uence that encodes a protein of this invention. A 
suitable host cell is any eucaryotic cell that can 
accomodate high level expression of an exogenously 
introduced gene or protein, and that will incorporate said 
protein into its membrane structure . Vectors for expression 
are those which comprise an Scarface 1 gene or fragment 
thereof, e.g. SEQ ID NO:l. Transformed host cells may be 
cultured irnder conditions well known to skilled artisans 
such that a protein of the present invention is expressed, 
thereby producing a recombinant Scarface 1 protein in the 
recombinant host cell . 

For the purpose of identifying compounds having 
utility as modifiers of angiogenesis, it would be desirable 
to identify compoimds that bind an Scarface 1 protein of the 
present invention, and/or modify its activity. A method for 
determining agents that bind the Scarface 1 protein cuid/or 
inhibit its activity, conqprises contacting the Scarface 1 
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protein with a test compound and monitoring binding by any 
suitable meems . 

The instant invention provides a screening system 
for discovering compounds that bind the Scarface 1 protein, 
5 said screening system comprising the steps of : 

a) preparing Scarface 1 protein; 

b) exposing said Scarface 1 protein to a test 
compound; 

c) gucuitifying the binding of said compound to 

10 Scarface 1 protein by any suitable rneams, for 

example, by monitoring a change in the ability of 
Scarface 1 to phosphorylate a receptor, e.g. TIE 
2. 

Utilization of the screening system described 
15 above provides a meeuis to identify compotinds that may alter 
the biological function of Scarface 1 proteins. This 
screening method may be adapted to large-scale, automated 

procedures such as a PANDEX® (Baxter-Dade Diagnostics) 
system, allowing for efficient high-volume screening for 

20 potential therapeutic agents. 

In one embodiment, Scarface 1 is prepared as 
described herein, preferably using recombinant DMA 
technology. A test compound is introduced into a reaction 
vessel containing an Scarface 1 protein or fragment thereof. 

25 Binding of Scarface 1 by a test compound is determined by 
any suitable means. For example, in one method 
radioactively- labeled or chemically- labeled test compound 
may be used. Binding of the protein by the test compound is 
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assessed, for example, by quantifying boiind label versus 
unbound label using any suitable method. 

Binding of a test compound may also be carried out 
by a method disclosed in U.S. Patent 5,585,277, which hereby 

5 is incorporated by reference. In this method, binding of a 
test compound to a protein is assessed by monitoring the 
ratio of folded protein to unfolded protein, for example, by 
monitoring sensitivity of said protein to a protease, or 
amenability for binding of said protein by a specific 

10 antibody against the folded state of the protein. 

The foregoing screening methods are useful for 
identifying an antagonist of a Scarface 1 protein as a lead 
to a pharmaceutical compoiind for treating cancer. For 
example, a compound that binds Scarface 1, or related 

15 fragment thereof, is identified by combining a test compound 
with Scarface 1 under conditions that cause the protein to 
exist in a ratio of folded to unfolded states . If a test 
compound binds the folded state of the protein, the relative 
amount of folded protein will be higher than in the case of 

20 a test compound that does not bind the protein. The ratio of 
protein in the folded versus unfolded state is easily 
detezrminable by, for exanple, susceptibility to digestion by 
a protease, or binding to a specific suitibody, or binding to 
chaperonin protein, or binding to any suitable surface. 

25 

Therapeutic Applications 

In one embodiment, the present invention relates 
to therapeutic applications in which modulation of 
angiogenesis is therapeutically beneficial. For example. 
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Scar£ace 1 can be administered to enhance wound healing such 
as in post-surgical procedures. 

In another embodiment, the invention relates to 
amplification of stem cells. Amplification of stem cells is 

5 useful for a variety of clinically relevant procedures, 

including, for example, transfusion procedures, or promoting 
the effectiveness of bone marrow transplants. Because the 
TIE receptors are expressed in early hematopoietic cells. 
Scarf ace 1 is expected to play a compareUsle role in the 

10 proliferation or differentiation or migration of these 

cells. Thus, for exair^le, TIE containing compositions may be 
prepared, assayed, examined in in vitro and in vivo 
biological systems and used therapeutically 
as described in any of the following: Sousa, U.S. Pat. No. 

15 4,810,643; Lee, et al., Proc. Natl. Acad. Sci. USA 82:4360- 
4364 (1985); Wong, et al. Science, 228:810-814 (1985); 
Yokota, et al, Proc. Natl. Acad. Sci (USA) 81:1070 (1984); 
Bosselman, et al. WO 9105795; and Kirkness, et al. WO 
95/19985, herein incorporated by reference. 

20 Accordingly, Scar face 1 may be used to diagnose or 

treat conditions in which normal hematopoiesis is 
suppressed, including, but not limited to anemia, 
thrombocytopenia, leukopenia and granulocytopenia. In a 
preferred embodiment. Scarf ace 1 may be used to stimulate 

25 differentiation of blood cell precursors in clinical 

situations where for example a patient has a disease, such 
as acq[uired immune deficiency syndrome (AIDS) which has 
caused a reduction in normal blood cell levels, or in 
clinical settings in which enhancement of hematopoietic 

30 populations is desired, such as in conjxmction with bone 
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marrow transplant, or in the treatment of aplasia or 
myelosuppression caused by radiation, chemical 
treatment or chemotherapy. Thus, the present invention 
provides a method for amplifying stem cells, in vivo or in 

5 vitro, from a population of cells that contain said stem 
cells. In one embodiment of this aspect, the invention 
comprises administering Scarface 1 peptide to amplify 
hematopoietic stem cells, in vivo or in vitro. For example, 
stem cell amplification may be achieved in vitro by exposure 

10 of bone marrow derived cells to Scarface 1 peptide. 

The Scarface 1 of the present invention may be used 
alone, or in combination with another pharmaceutically 
active agent such as, for example, ctyokines, neurotrophins , 
interleukins , etc. In a preferred embodiment, the ligand may 

15 be used in conjunction with any of a number of the above 
referenced factors which are known to induce stem cell or 
other hematopoietic precursor proliferation, or factors 
acting on later cells in the hematopoietic pathway, 
including, but not limited to, hemopoietic maturation 

20 factor, thrombopoietin, stem cell factor, erythropoietin, G- 
CSF, GM-CSF, etc. 

The Scarface 1 proteins, peptide fragments, or 
derivatives may be administered systemically or locally. Any 
appropriate mode of administration known in the art may be 

25 used, including, but not limited to, intravenous, 

intrathecal, intraarterial, intranasal, oral, stibcutaineous , 
intraperitoneal, or by local injection or surgical implauxt. 
Sustained release formulations are also provided for. 



30 treating cancer and for inhibiting tumor growth, in vitro or 



The present invention also provides methods for 
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in vivo comprising administration of an effective amoiint of 
an Scarf ace 1 antagonist. 



in which enhanced angiogenesis is advamtageous , an effective 
5 amount of Scarface 1 protein is administered to an organism 
in need thereof in a dose between aibout 1 and 1000 ug/kg 
body weight. In practicing the methods contemplated, 
Scarface 1 can be administered in a single daily dose or in 
multiple doses per day. The amount per administration will 
10 be determined by the physician and depend on such factors as 
the nature and severity of the disease, and the age cuid 
general health of the patient. 



pharmaceutical composition comprising as the active agent a 
15 Scarface 1 polypeptide or fragment, or a phaintnaceutically 
acceptable non- toxic salt thereof, and a pharmaceutically 
acceptable solid or liquid carrier. For example, compounds 
comprising Scarface 1 can be admixed with conventional 
pharmaceutical carriers and excipients, and used in the form 
20 of tablets, capsules, elixirs, suspensions, syrups, wafers, 
and the like. The compositions comprising Scarface 1 will 
contain from about 0.1% to 90% by weight of the active 
compound, and more generally from about 10% to 30%. The 
compositions may contain common carriers and excipients such 
25 as com starch or gelatin, lactose, sucrose, 

microcrystalline cellulose, kaolin, mannitol, dicalcixim 
phosphate, sodium chloride, and alginic acid. The compounds 
can be formulated for oral or parenteral administration. 



30 is administered in commonly used intravenous fluid (s) cuid 



For therapeutic use in wound healing or other use 



The present invention also provides a 



For intravenous (IV) use, the Scarface 1 protein 



• 
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administered by infusion. Such fluids ^ for example, 
physiological saline. Ringer's solution or 5% dextrose 



solution can be used. 



For intramuscular preparations, a sterile 



5 formulation, preferably a suitable soluble salt form of the 
Scarf ace 1 protein, for example SEQ ID NO: 2, such as the 
hydrochloride salt, can be dissolved and administered in a 
pharmaceutical diluent such as pyrogen- free water 
(distilled) , physiological saline or 5% glucose solution. A 

10 suitable insoluble form of the compound may be prepared eoid 
administered as a suspension in an aqueous base or a 
pharmaceutically acceptable oil base, e.g. an ester of a 
long chain fatty acid such as ethyl oleate. 



15 are dependent on the selectivity of the compound tested. 
For example, a compound with an IC50 which is less than 10 
nM is generally considered an excellent candidate for drug 
therapy. However, a compoxind which has a lower affinity, 
but is selective for a particular target, may be an even 

20 better ceuididate. The skilled artisan will recognize that 
any information regarding the binding potential, inhibitory 
activity, or selectivity of a particular compound is useful 
toward the development of pharmaceutical products. 



25 present invention. Those skilled in the art will recognize 
that the particular reagents, equipment, and procedures 
described are merely illustrative and are not intended to 
limit the present invention in any manner. 



Skilled artisans will recognize that IC50 values 



The following examples more fully describe the 



30 



EXAMPLE 1 
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RT-PCR Amplification of Scarf ace 1 from mRNA 
The Scarface 1 gene is isolated by reverse 
transcriptase PGR (RT-PCR) using conventional methods. Total 
RNA from a tissue that expresses the Scarface 1 gene, for 

5 excimple, muscle tissue, or reproductive tissues, is prepared 
using standard methods. First strand cDNA synthesis is 
achieved using a commercially available kit (Superscript^** 
System; Life Technologies) by PGR in conjunction with 
specific primers directed at any suitable region of SEQ ID 

10 NO:l. 

Amplification is carried out by adding to the 
first strand cDNA (dried under vacuum) : 8 nl of lOX 
synthesis buffer (200 mM Tris-HCl, pH 8.4; 500 mM KCl, 25 mM 
MgCl^, 1 ug/ul BSA) ; 68 |al distilled water; 1 jxl each of a 

15 10 uM solution of each primer; and 1 \il Taq DNA polymerase 
(2 to 5 U/nD . The reaction is heated at 94° C for 5 min. to 
denature the RNA/cDNA hybrid. Then, 15 to 30 cycles of PGR 
amplification are performed using any suitable thermal cycle 
apparatus. The amplified sample may be analyzed by agarose 

20 gel electrophoresis to check for an appropriately- sized 
fragment . 

EXAMPLE 2 

Production of a Vector for Expressing Scarface 1 in a Host 
25 Cell 

An expression vector suitable for expressing 
Scarface 1 or fragment thereof in a variety of procaryotic 
host cells, such as E. coli is easily made. The vector 
contains an origin of replication (Ori) , an ampicillin 
30 resistance gene (Amp) useful for selecting cells which have 
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incorporated the vector following a tranf ormation procedure, 
and further comprises the T7 promoter and T7 terminator 
sequences in operable linkage to a Scarf ace 1 coding region. 
Plasmid pETllA (obtained from Novogen, Madison WI) is a 

5 suitable parent plasmid. pETllA is linearized by restriction 
with endonucleases Ndel and BamHI. Linearized pETllA is 
ligated to a DNA fragment bearing Ndel and BaznHI sticky ends 
cuid comprising the coding region of the Scarface 1 gene as 
disclosed by SEQ ID NO : 1 . 

10 The Scarface 1 gene used in this construction may 

be slightly modified at the 5' end (amino terminus of 
encoded protein) in order to simplify purification of the 
encoded protein product. For this purpose, an 
oligonucleotide encoding 8 histidine residues is inserted at 

15 or near the 5' end of SEQ ID NO:l, so as to maintain the 
correct reading frame. Placement of the histidine residues 
at the amino terminus of the encoded protein serves to 
enable the IMAC one- step protein purification procedure. 

20 EXAMPLE 3 

Recombinant Expression and Purification of Scarface 1 

Protein 

An expression vector that carries a gene encoding 
Scarface 1 or fragment thereof and which gene is operably- 

25 linked to an expression promoter is transformed into E. coli 
BL21 (DE3) {hsdS gal lclts857 indlSam7nin5lacUV5-T7gene 1) 
using standard methods. Trsuisf ormants, selected for 
resistance to ampicillin, are chosen at random auid tested 
for the presence of the vector by agarose gel 

30 electrophoresis using cjuick plasmid preparations. Colonies 



# 
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which contain the vector are grown in L broth and the 
protein product encoded by the vector-borne gene is purified 
by immobilized metal ion affinity chromatography (IMAC) , 
essentially as described in US Patent 4,569,794. 

5 Briefly, the IMAC column is prepared as follows. A 

metal -free chelating resin (e.g. Sepharose 6B IDA, 
Pharmacia) is washed in distilled water to remove 
preservative substeuices and infused with a suitable metal 
ion [e.g. Ni(II), Co(II), or Cu(II)] by adding a 50mM metal 

10 chloride or metal sulfate aqueous solution until about 75% 
of the interstitial spaces of the resin are saturated with 
colored metal ion. The column is then ready to receive a 
crude cellular extract containing the recombinant protein 
product . 

15 After removing unbound proteins and other materials by 

washing the column with any suitable buffer, pH 7,5, the 
bound protein is eluted in amy suitable buffer at pH 4.3, or 
preferably with an imidizole- containing buffer at pH 7.5. 



20 

EXAMPLE 4 

Tissue Distributuion of Scarface 1 mRNA 
Scarface 1 mRNA was detected in a mammalian tissue by 
Northern analysis. Total RNA from different tissues was 
25 isolated by a standard guanidine chloride /phenol extraction 
method, and poly-A* RNA isolated using oligo (dT) -cellulose 
type 7 (Pharmacia) . Electrophoresis of RNA samples was 
carried out in formaldehyde followed by capillary transfer 
to Zeta-Probe™ nylon membranes (Bio-Rad, Hercules, Calif.). 
30 SEQ ID NO:l was used as a template for generating probes 
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using a MultiPrime"^** random priming kit (Amersham, Arlington 
Heights, 111.) . The efficiency of the labeling reaction was 
approximately 4 x 10^° cpm incorporated per ^g of template. 
The hybridization buffer contained 0,SM sodixim phosphate, 7% 

5 SDS (wt/vol) , 1% BSA (wt/vol) , and 1 mM EDTA. 

Prehybridization was carried out in hybridization buffer at 
65** C for 2 h, and ^^P- labeled probe was added and incubation 
continued overnight. Filters were washed in Buffer A (40 xnM 
sodium phosphate pH 7.2, 5% SDS [wt/vol], 0.5% BSA [wt/vol], 

10 and 1 mM EDTA) at 65'' C for 1 h, then in Buffer B (40 mM 

sodium phosphate, pH 7.2, 1% SDS [wt/vol], and 1 mM EDTA) at 
65^ C for 20 min. Washed filters were air-dried and exposed 
to Kodak X-OMAT AR film at -80** C with axi intensifying 
screen. The results (see Figure 1) demonstrate that Scarf ace 

15 1 mRNA is expressed in a plurality of tissues and that two 
major species are observed, one approximately 2.4 Kb in size 
and the other about 4 Kb. 



The wells of an ELISA plate are coated with chaperonin 
by incubation for several hours with a 4 ug/ml solution of 
the protein in Tris-buf f ered Saline (TBS: 10 mM Tris-HCl, 

25 pH7.5, 0.2M NaCl) . Plates are washed 3 times with TBS 

containing 0.1% Tween-20 (TBST) . Then, a mixture of Scarf ace 
1 protein (sufficient amotint to saturate about 50% of the 
binding sites on chaperonin) and test compound (10'^ to 10'^ 
M) in about 50 ^l volume is added to each well of the plate 

30 for eui incubation of about 60 minutes. Aliguots from each 



EXAMPLE 5 



Detecting Ligsmds that Bind Scarface 1 Using a 
Chaperonin Protein Assay 
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well are then transferred to the wells of fresh plates and 
incubation is continued for 60 minutes at room temperature, 
followed by 3 washes with TEST. Next, about 50 ^il of an 
antibody specific for Scarface 1 in 5% nonfat dry milk is 

5 added to each well for 3 0 minutes at room temperature. After 
washing, about 50 }il of goat anti- rabbit IgG alkaline 
phosphatase conjugate at an appropriate dilution in TBST 
plus 5% nonfat dry milk is added to each will and incubated 
30 minutes at room temperature. The plates are washed again 

10 with TBST, and 0 . 1 ml of 1 mg/ml p--nitrophenylphosphate in 
0.1% diethanolamine is added. Color development 
(proportional to boxind alkaline phosphatase amtibody 
conjugate) is monitored with an ELISA plate reader. When 
binding by the test ligand has occurred, ELISA analysis 

15 reveals Scarface 1 in solution at higher concentrations thcux 
in the absence of test ligaund. 



thereof or analog thereof is isolated from transfected or 

transformed cells using any of the well known methods in the 

art, or by a method specifically disclosed herein. 

Concentration of protein is adjusted, for example, by 
25 filtration through an Amicon filter device such that the 

level is about 1 to 5 ug/ml. Monoclonal or polyclonal 

antibody is prepared as follows. 

Monoclonal antibody can be prepared from murine 

hybridomas according to the method of Kohler and Milstein 
30 (Nature, 256, 495, 1975), or a modified method thereof. 



EXT^PLE 6 



Production of an Antibody to Scarface 1 Protein 
Sxibstantially pure Scarface 1 protein or fragment 
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Briefly, a mouse is repetitively inoculated with a few 
micrograms of the Scarf ace 1 protein (e.g. SEQ ID NO: 2) or 
fragment thereof, or analog thereof, over a period of a few 
weeks. The mouse is then sacrificed, and antibody producing 

5 cells of the spleen are isolated. Spleen cells are fused 
with mouse myeloma cells using polyethylene glycol . Fused 
cells that produce antibody specific for Scarface 1 are 
identified by any suitable immunoassay, for example, ELISA, 
as described in E. Engvall, Meth. Enzymol., 70, 419, 1980. 

10 Polyclonal antiserum can be prepared by well knovm 

methods {See e.g. J. Vaitukaitis et.al. Clin. Endocimol. 
Metab. 33, 988, 1971) that involve immxinizing suitable 
animals with a Scarface 1 protein, fragments thereof, or 
fusion proteins thereof. Small doses (e.g. nanogram amounts) 

15 of antigen administered at multiple intradermal sites are 
the most reliable method. 



specific phosphorylation, essentially as described in S. 
Davis at al.. Cell, 87, 1161-1169, 1996; and T. Stitt at. 
al. Cell, 80, 661-670, 1995, both of which are herein 
incorporated by reference. Briefly, assays are done using 

25 late-conf luency T-75 flasks of human umbilical vein 

endothelial cells (available from Clonetics, Inc. San Diego, 
CA) . Cells are serum starved in high glucose DMDM for 2-4 
hr« Conditioned mediiun, plus and minus varying amounts of 
Scarface 1 protein from about 10 ng/ml to about 10 ug/ml, is 

30 placed on cells for 5-10 min. at 37®C. Cells are then 



EXAMPLE 7 



Assay For Scarface 1 Activity 
Scarface 1 activity is assayed by measuring tyrosine- 
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lysed in 1% NP-40 in PBS containing 1 mM PMSF, 0.14 U/ml 
aprotinin, 1 mM EDTA, and 1 mM sodium orthovanadate . The 
lysates are immunoprecipitated with a phosphotyrosine 
specific rabbit antiserum, elect rophoresed, and 
5 immunoblotted (Western Blot) to nylon membranes. Following 
incLibation of the blots with HRP- conjugated goat anti- rabbit 
eintibodies ( Caltag, Inc.) bands are visualized by ECL. 



Assay reactions are set up essentially as described in 
Example 7, accept that a compo\ind to be tested for Scarf ace 
1 antagonist activity is included at the step of adding 
Scarf ace 1 to conditioned medium for 5-10 min. prior to 
IS cell lysis . Multiple assays can be set up in which a 

constant cmiount of Scarface 1 is incubated with varying 
amounts of test compound, for example from about 10 ng/ml to 
about 100 ug/ml . 



EXAMPLE 8 



10 



Assay For Scarface 1 Antagonist 



20 
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WE CLAIM: 



1 . A siibstantially pure protein having an amino acid 
sequence that is at least 70% identical to SEQ ID NO: 2, 
5 said protein exhibiting Scarf ace 1-like activity in vivo or 
in vitro. 

2 « A substantially pure protein having an amino acid 
sequence that is encoded by a nucleic acid sequence that 
10 hybridizes to SEQ ID NO:l under high stringency conditions 
said protein having Scarface 1 activity. 

3 • A substantially pure protein having an amino acid 
sequence that is encoded by a nucleic acid sequence that 
15 hybridizes to SEQ ID NO:l under high stringency conditions 
said protein having Scarface 1 activity and containing a 
fibrinogen -like domain. 



20 amino acid sequence identified herein as SEQ ID NO: 2. 

5. A functional fragment of a Scarface 1 protein or 
analog comprising a fibrinogen- like domain said domain 
having an amino acid sequence that is structurally and 
25 functionally related to residues from about 275 through 
about 493 of SEQ ID NO: 2. 



4 . A substantially pure Scarface 1 protein having an 



6 . A functional fragment of a Scarface 1 comprising a 
fibrinogen- like domain said domain having an amino acid 
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sequence that is defined by residues from about 275 through 
about 493 of SEQ ID NO: 2. 

7. A functional fragment of a Scarf ace 1 or analog 

5 comprising a coiled-coil domain said domain having an amino 
acid sequence that is structurally cuid functionally related 
to residues from about 1 through about 274 of SEQ ID NO: 2. 

8. A functional fragment of a Scarf ace 1 comprising a 

10 coiled- coil domain said domain having an amino acid sequence 
that is defined by residues from about 1 through about 274 
of SEQ ID NO: 2. 



15 partner . 

10 . A fusion protein as in Claim 9 wherein said fusion 
partner comprises a constant region of an immunoglobulin. 



9. A fusion protein comprising Scarf ace 1 and a fusion 



20 



11. A fusion protein as in Claim 10 wherein said 
Scar face 1 comprises a fibrinogen- like domain of SEQ ID 



NO:2. 



12 . An isolated nucleic acid compound encoding a 
25 protein of Claim 1 , Claim 2 , or Claim 3 . 



13 . An isolated nucleic acid compound that encodes a 
protein of Claim 4 . 
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14 • An Isolated nucleic acid compound that encodes a 
protein of Claim 5 wherein said nucleic acid is SEQ ID NO:l. 



5 peptide of Claim 6, Claim 7, Claim 8, or Claim 9. 

16, A vector comprising an isolated nucleic acid 
compoxind encoding a Scarf ace 1* 



18. A method for constructing a recombinant host cell 
having the potential to esqpress an Scarf ace 1 protein, said 
method comprising introducing into said host cell by any 

15 suitable means a vector of Claim 16. 

19. A method for expressing an Scarf ace 1 protein in a 
recombinant host cell of Claim 18, said method comprising 
culturing said recombinant host cell under conditions 

20 suitable for gene expression. 

20. A method for identifying compounds that bind a 
Scarf ace 1 protein, comprising the steps of: 

a) admixing a substantially purified preparation of a 
25 Scar face 1 protein with a test compound; and 



15 . An isolated nucleic acid compound encoding a 



10 



17. A host cell containing a vector of Claim 16. 



b) monitoring by any suitable mesms a binding 

interaction between said protein and said compoxmd. 



21. An antibody that selectively binds a protein 
30 identified herein as SEQ ID NO: 2. 
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22 . A pharmaceutical formulation comprising as an 
active ingredient an Scarf ace 1 protein or peptide, 
associated with one or more pharmaceutically acceptable 
carriers, excipients, or diluents thereof. 

23 . A pharmaceutical formulation comprising as an 
active ingredient an agonist or antagonist of axx Scarf ace 1, 
associated with one or more pharmaceutically acceptable 
carriers, excipients, or diluents thereof. 

24. A method for identifying an Scarf ace 1 antagonist 
comprising the steps of : 

a) admixing a substantially purified preparation of 
Scarf ace 1 with a test cont^otind; and 

b) monitoring, by any suitable meauis, an inhibition in 
the biological activity of Scarf ace 1. 

25. A method for identifying an Scar face 1 agonist 
comprising the steps of : 

a) admixing a substantially purified preparation 
of Scar face 1 with a test compound; and 

b) monitoring, by any suitable means, an 
enhancement in the biological activity of Scarface 
1. 

26. A vector comprising an isolated nucleic acid 
compound comprising SEQ ID NO:l. 



27. A host cell containing a vector of Claim 26. 
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28. A method for inhibiting tissue growth by 
administration of a therapeutically effective amount of an 
Scarf ace 1 antagonist . 

29. A method for stimulating tissue growth by 
administration of an effective amount of Scarf ace 1. 

30. A method for axiqplifying stem cells comprising the 
step of administration of Scarface 1 to a population of 
cells that comprise said stem cells. 

31. A method for expanding hematopoietic stem cells 
coznprising the step of administration of Scarface 1 to a 
population of bone marrow cells. 

32 . A method for identifying a nucleic acid that 
encodes a protein that is structurally and functionally 
related to Scarface 1, comprising the steps of: 

a) hybridizing SEQ ID N0;1, or fragment thereof, to a 
saxnple comprising one or more nucleic acids under 
high stringency conditions; and 

b) detecting nucleic acids that hybridize to said SEQ 
ID NO:l or fragment thereof. 

33. A method for identifying a nucleic acid that is 
structurally similar to SEQ ID NO:l, comprising the steps 
of: 

a) comparing the secjuence of SEQ ID NO:l or fragment 
thereof or siib- region thereof with a database 
comprising one or more nucleic acids; and 
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b) identifying by any suitable algorithm segiiences 
from said database that are homologous to SEQ ID 
N0:1 or fragment thereof or sub-region thereof, 

34. A method for identifying a protein that is structurally 
and fimctionally similar to Scarf ace 1, comprising the steps 
of: 

a) comparing a database comprising one or more 
proteins with SEQ ID NO: 2, or fragment thereof; and 

b) identifying by any suitable algorithm proteins from 
said database that are similar to Scarf ace 1. 

35* An isolated nucleic acid that encodes a protein having 
Scarface 1 activity, said nucleic acid hybridizing to SEQ ID 
NO:l iinder high stringency conditions. 
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SEQUENCE LISTING 

<110> Burgett, Stanley G. 
Little, Rebecca A. 
Rosteck Jr., Paul R. 
Heuer, Josef G. 
Ssuikhavaram, Patanjali R. 

<120> T^giopoietin Related Gene Sequence Scarface 1 

<130> X-12260 

<140> 
<141> 

<160> 2 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 1479 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (I),. (1479) 

<400> 1 

atg agg cca ctg tgc gtg aca tgc tgg tgg etc gga ctg ctg get gcc 48 
Met Arg Pro Leu Cys Val Thr Cys Trp Trp Leu Gly Leu Leu Ala Ala 
15 10 15 

atg gga get gtt gea ggc cag gag gac ggt ttt gag ggc act gag gag 96 
Met Gly Ala Val Ala Gly Gin Glu Asp Gly Phe Glu Gly Thr Glu Glu 
20 25 30 



ggc teg cca aga gag tte att tac eta aae agg tae aag egg geg ggc 
Gly Ser Pro Arg Glu Phe lie Tyr Leu Asn Arg Tyr Lys Arg Ala Gly 
35 40 45 



144 



gag tec cag gac aag tgc ace tae ace tte att gtg cec eag cag egg 192 
Glu Ser Gin Asp Lys Cys Thr Tyr Thr Phe lie Val Pro Gin Gin Arg 
50 55 60 



gtc aeg ggt gcc ate tgc gtc aae tec aag gag cet gag gtg ctt ctg 
Val Thr Gly Ala lie Cys Val Asn Ser Lys Glu Pro Glu Val Leu Leu 
65 70 75 80 

gag aae cga gtg cat aag cag gag eta gag ctg etc aae aat gag ctg 
Glu Asn Arg Val His Lys Gin Glu Leu Glu Leu Leu Asn Asn Glu Leu 
85 90 95 



240 



288 



etc aag cag aag egg eag ate gag aeg ctg cag cag ctg gtg gag gtg 336 
Leu Lys Gin Lys Arg Gin lie Glu Thr Leu Gin Gin Leu Val Glu Val 
100 105 110 



gac ggc ggc att gtg age gag gtg aag ctg ctg cge aag gag age cge 



384 



# 
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Asp Gly Gly lie Val Ser Glu Val Lys Leu Leu Arg Lys Glu Ser Arg 
115 120 125 

aac atg aac teg egg gtc acg cag etc tae atg cag etc ctg cae gag 432 
Asn Met Asn Ser Arg Val Thr Gin Leu Tyr Met Gin Leu Leu His Glu 
130 135 140 

ate ate cgc aag egg gae aae gcg ttg gag etc tee cag ctg gag aac 480 
lie lie Arg Lys Arg Asp Asn Ala Leu Glu Leu Ser Gin Leu Glu Asn 
145 150 155 160 

agg ate ctg aae cag aea gee gae atg ctg cag ctg gee age aag tac 528 
Arg lie Leu Asn Gin Thr Ala Asp Met Leu Gin Leu Ala Ser Lys Tyr 
165 170 175 



aag gae ctg gag cac aag tac cag cae ctg gee aca ctg gcc cae aac 576 
Lys Asp Leu Glu His Lys Tyr Gin His Leu Ala Thr Leu Ala His Asn 
180 185 190 

caa tea gag ate ate gcg cag ctt gag gag cac tgc cag agg gtg ccc 624 
Gin Ser Glu lie lie Ala Gin Leu Glu Glu His Cys Gin Arg Val Pro 
195 200 205 

teg gcc agg ccc gtc ccc cag cca ccc ccc get gcc ccg ccc egg gtc 672 
Ser Ala Arg Pro Val Pro Gin Pro Pro Pro Ala Ala Pro Pro Arg Val 
210 215 220 

tac caa cca ccc acc tac aac cgc ate ate aac cag ate tct ace aac 720 
Tyr Gin Pro Pro Thr Tyr Asn Arg lie lie Asn Gin lie Ser Thr Asn 
225 230 235 240 

gag ate cag agt gae cag aac ctg aag gtg ctg cca ccc cet ctg ccc 768 
Glu lie Gin Ser Asp Gin Asn Leu Lys Val Leu Pro Pro Pro Leu Pro 
245 250 255 

act atg ccc act etc acc age etc cca tct tec ace gae aag ccg teg 816 
Thr Met Pro Thr Leu Thr Ser Leu Pro Ser Ser Thr Asp Lys Pro Ser 
260 265 270 

gge cca tgg aga gae tgc ctg cag gee ctg gag gat gge cac gae ace 864 
Gly Pro Trp Arg Asp Cys Leu Gin Ala Leu Glu Asp Gly His Asp Thr 
275 280 285 

age tec ate tae ctg gtg aag ccg gag aac ace aae cgc etc atg cag 912 
Ser Ser He Tyr Leu Val Lys Pro Glu Asn Thr Asn Arg Leu Met Gin 
290 295 300 

Srtg tgg tgc gae cag aga cae gae ccc ggg gge tgg ace gtc ate cag 960 
Val Trp Cys Asp Gin Arg His Asp Pro Gly Gly Trp Thr Val He Gin 
305 310 315 320 

aga cgc ctg gat gge tct gtt aae tte ttc agg aae tgg gag acg tac 
1008 

Arg Arg Leu Asp Gly Ser Val Asn Phe Phe Arg Asn Trp Glu Thr Tyr 

325 330 335 

aag caa ggg ttt ggg aac att gae gge gaa tae tgg ctg gge ctg gag 
1056 
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Lys Gin Gly Phe Gly Asn He Asp Gly Glu Tyr Trp Leu Gly Leu Glu 
340 345 350 

aac att tac tgg ctg acg aac caa ggc aac tac aaa etc ctg gtg acc 
1104 

Asn He Tyr Trp Leu Thr Asn Gin Gly Asn Tyr Lys Leu Leu Val Thr 
355 360 365 

atg gag gac tgg tec ggc cgc aaa gtc ttt gca gaa tac gcc agt ttc 
1152 

Met Glu Asp Trp Ser Gly Arg Lys Val Phe Ala Glu Tyr Ala Ser Phe 
370 375 380 

cgc ctg gaa cct gag age gag tat tat aag ctg egg ctg ggg cgc tac 
1200 

Arg Leu Glu Pro Glu Ser Glu Tyr Tyr Lys Leu Arg Leu Gly Arg Tyr 
385 390 395 400 

cat ggc aat gcg ggt gac tec ttt aca tgg cac aac ggc aag cag ttc 
1248 

His Gly Asn Ala Gly Asp Ser Phe Thr Trp His Asn Gly Lys Gin Phe 
405 410 415 

acc acc ctg gac aga gat cat gat gtc tac aca gga aac tgt gcc cac 
1296 

Thr Thr Leu Asp Arg Asp His Asp Val Tyr Thr Gly Asn Cys Ala His 
420 425 430 

tac cag aag gga ggc tgg tgg tat aac gee tgt gcc cac tec aac etc 
1344 

Tyr Gin Lys Gly Gly Trp Trp Tyr Asn Ala Cys Ala His Ser Asn Leu 
435 440 445 

aac. ggg gtc tgg tac cgc ggg ggc eat tac egg age cgc tac cag gac 
1392 

Asn Gly Val Trp Tyr Arg Gly Gly His Tyr Arg Ser Arg Tyr Gin Asp 
450 455 460 

gga gtc tac tgg get gag ttc ega gga ggc tet tac tea etc aag aaa 

1440 

Gly Val Tyr Trp Ala Glu Phe Arg Gly Gly Ser Tyr Ser Leu Lys Lys 
465 470 475 480 

gtg gtg atg atg ate ega eeg aac ecc aac ace ttc cac 
1479 

Val Val Met Met He Arg Pro Asn Pro Asn Thr Phe His 
485 490 



<210> 2 

<211> 493 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Arg Pro Leu Cys Val Thr Cys Trp Trp Leu Gly Leu Leu Ala Ala 
15 10 15 



# 
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Met Gly Ala Val Ala Gly Gin Glu Asp Gly Phe Glu Gly Thr Glu Glu 
20 25 30 

Gly Ser Pro Arg Glu Phe lie Tyr Leu Asn Arg Tyr Lys Arg Ala Gly 
35 40 45 

Glu Ser Gin Asp Lys Cys Thr Tyr Thr Phe lie Val Pro Gin Gin Arg 
50 55 60 

Val Thr Gly Ala lie Cys Val Asn Ser Lys Glu Pro Glu Val Leu Leu 
65 70 75 80 

Glu Asn Arg Val His Lys Gin Glu Leu Glu Leu Leu Asn Asn Glu Leu 
85 90 95 

Leu Lys Gin Lys Arg Gin lie Glu Thr Leu Gin Gin Leu Val Glu Val 
100 105 110 

Asp Gly Gly lie Val Ser Glu Val Lys Leu Leu Arg Lys Glu Ser Arg 
115 120 125 

Asn Met Asn Ser Arg Val Thr Gin Leu Tyr Met Gin Leu Leu His Glu 
130 135 140 

lie lie T^g Lys Arg Asp Asn Ala Leu Glu Leu Ser Gin Leu Glu Asn 
145 150 155 160 

Arg lie Leu Asn Gin Thr Ala Asp Met Leu Gin Leu Ala Ser Lys Tyr 
165 170 175 

Lys Asp Leu Glu His Lys Tyr Gin His Leu Ala Thr Leu Ala His Asn 
180 185 190 

Gin Ser Glu lie lie Ala Gin Leu Glu Glu His Cys Gin Arg Val Pro 
195 200 205 

Ser Ala Arg Pro Val Pro Gin Pro Pro Pro Ala Ala Pro Pro Arg Val 
210 215 220 

Tyr Gin Pro Pro Thr Tyr Asn Arg lie He Asn Gin He Ser Thr Asn 
225 230 235 240 

Glu He Gin Ser Asp Gin Asn Leu Lys Val Leu Pro Pro Pro Leu Pro 
245 250 255 

Thr Met Pro Thr Leu Thr Ser Leu Pro Ser Ser Thr Asp Lys Pro Ser 
260 265 270 

Gly Pro Trp Arg Asp Cys Leu Gin Ala Leu Glu Asp Gly His Asp Thr 
275 280 285 

Ser Ser He Tyr Leu Val Lys Pro Glu Asn Thr Asn Arg Leu Met Gin 
290 295 300 

Val Trp Cys Asp Gin Arg His Asp Pro Gly Gly Trp Thr Val He Gin 
305 310 315 320 



Arg Arg Leu Asp Gly Ser Val Asn Phe Phe Arg Asn Trp Glu Thr Tyr 
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325 



330 



335 



Lys Gin Gly Phe Gly Asn He Asp Gly Glu Tyr Trp Leu Gly Leu Glu 
340 345 350 

Asn He Tyr Trp Leu Thr Asn Gin Gly Asn Tyr Lys Leu Leu Val Thr 
355 360 365 

Met Glu Asp Trp Ser Gly Arg Lys Val Phe Ala Glu Tyr Ala Ser Phe 
370 375 380 

Arg Leu Glu Pro Glu Ser Glu Tyr Tyr Lys Leu Arg Leu Gly Arg Tyr 
385 390 395 400 

His Gly Asn Ala Gly Asp Ser Phe Thr Trp His Asn Gly Lys Gin Phe 
405 410 415 

Thr Thr Leu Asp Arg Asp His Asp Val Tyr Thr Gly Asn Cys Ala His 
420 425 430 

Tyr Gin Lys Gly Gly Trp Trp Tyr Asn Ala Cys Ala His Ser Asn Leu 
435 440 445 

Asn Gly Val Trp Tyr Arg Gly Gly His Tyr Arg Ser Arg Tyr Gin Asp 
450 455 460 

Gly Val Tyr Trp Ala Glu Phe Arg Gly Gly Ser Tyr Ser Leu Lys Lys 
465 470 475 480 



Val Val Met Met He Arg Pro Asn Pro Asn Thr Phe His 
485 490 
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